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Preface 



Adaptive control has been extensively investigated and developed in both 
theory and application during the past few decades, and it is still a very active 
research field. In the earlier stage, most studies in adaptive control concen- 
trated on linear systems. A remarkable development of the adaptive control 
theory is the resolution of the so-called ideal problem, that is, the proof that 
several adaptive control systems are globally stable under certain ideal con- 
ditions. Then the robustness issues of adaptive control with respect to non- 
ideal conditions such as external disturbances and unmodelled dynamics were 
addressed which resulted in many different robust adaptive control algorithms. 
These robust algorithms include dead zone, normalization, e-modification, e\- 
modification among many others. At the same time, extensive study has been 
carried out for reducing a priori knowledge of the systems and improving the 
transient performance of adaptive control systems. Most recently, adaptive 
control of nonlinear systems has received great attention and a number of 
significant results have been obtained. 

In this book, we have compiled some of the most recent developments of 
adaptive control for both linear and nonlinear systems from leading world 
researchers in the field. These include various robust techniques, performance 
enhancement techniques, techniques with less a priori knowledge, adaptive 
switching techniques, nonlinear adaptive control techniques and intelligent 
adaptive control techniques. Each technique described has been developed to 
provide a practical solution to a real-life problem. This volume will therefore 
not only advance the field of adaptive control as an area of study, but will also 
show how the potential of this technology can be realized and offer significant 
benefits. 

The first contribution in this book is ‘Adaptive internal model control’ by A. 
Datta and L. Xing. It develops a systematic theory for the design and analysis 
of adaptive internal model control schemes. The ubiquitous certainty equiva- 
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lence principle of adaptive control is used to combine a robust adaptive law 
with robust internal model controllers to obtain adaptive internal model 
control schemes which can be proven to be robustly stable. Specific controller 
structures considered include those of the model reference, partial pole 
placement, and Hi and H x optimal control types. The results here not only 
provide a theoretical basis for analytically justifying some of the reported 
industrial successes of existing adaptive internal model control schemes but 
also open up the possibility of synthesizing new ones by simply combining a 
robust adaptive law with a robust internal model controller structure. 

The next contribution is ‘An algorithm for robust direct adaptive control 
with less prior knowledge’ by G. Feng, Y. A. Jiang and R. Zmood. It discusses 
several approaches to minimizing a priori knowledge required on the unknown 
plants for robust adaptive control. It takes a discrete time robust direct 
adaptive control algorithm with a dead zone as an example. It shows that 
for a class of unmodelled dynamics and bounded disturbances, no knowledge 
of the parameters of the upper bounding function on the unmodelled dynamics 
and disturbances is required a priori. Furthermore it shows that a correction 
procedure can be employed in the least squares estimation algorithm so that no 
knowledge of the lower bound on the leading coefficient of the plant numerator 
polynomial is required to achieve the singularity free adaptive control law. The 
global stability and convergence results of the algorithm are established. 

The next contribution is ‘Adaptive variable structure control’ by C. J. 
Chiang and Lichen Fu. A unified algorithm is presented to develop the variable 
structure MRAC for an SISO system with unmodelled dynamics and output 
measurement noises. The proposed algorithm solves the robustness and 
performance problem of the traditional MRAC with arbitrary relative 
degree. It is shown that without any persistent excitation the output tracking 
error can be driven to zero for relative degree-one plants and driven to a small 
residual set asymptotically for plants with any higher relative degree. 
Furthermore, under suitable choice of initial conditions on control parameters, 
the tracking performance can be improved, which is hardly achievable by the 
traditional MRAC schemes, especially for plants with uncertainties. 

The next contribution is ‘Indirect adaptive periodic control’ by D. 
Dimogianopoulos, R. Lozano and A. Ailon. This new, indirect adaptive 
control method is based on a lifted representation of the plant which can be 
stabilized using a simple performant periodic control scheme. The controller 
parameters computation involves the inverse of the controllability/observa- 
bility matrix. Potential singularities of this matrix are avoided by means of an 
appropriate estimates modification. This estimates transformation is linked to 
the covariance matrix properties and hence it preserves the convergence 
properties of the original estimates. This modification involves the singular 
value decomposition of the controllability/observability matrix’s estimate. As 
compared to previous studies in the subject the controller proposed here does 
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not require the frequent introduction of periodic //-length sequences of zero 
inputs. Therefore the new controller is such that the system is almost always 
operating in closed loop which should lead to better performance 
characteristics. 

The next contribution is ‘Adaptive stabilization of uncertain discrete-time 
systems via switching control: the method of localization’ by P. V. 
Zhivoglyadov, R. Middleton and M. Fu. It presents a new systematic switching 
control approach to adaptive stabilization of uncertain discrete-time systems. 
The approach is based on a method of localization which is conceptually 
different from supervisory adaptive control schemes and other existing switch- 
ing control schemes. The proposed approach allows for slow parameter 
drifting, infrequent large parameter jumps and unknown bound on exogenous 
disturbances. The unique feature of the localization-based switching adaptive 
control proposed here is its rapid model falsification capability. In the LTI case 
this is manifested in the ability of the switching controller to quickly converge 
to a suitable stabilizing controller. It is believed that the approach is applicable 
to a wide class of linear time invariant and time-varying systems with good 
transient performance. 

The next contribution is ‘Adaptive nonlinear control: passivation and small 
gain techniques’ by Z. P. Jiang and D. Hill. It proposes methods to system- 
atically design stabilizing adaptive controllers for new classes of nonlinear 
systems by using passivation and small gain techniques. It is shown that for a 
class of linearly parametrized nonlinear systems with only unknown param- 
eters, the concept of adaptive passivation can be used to unify and extend most 
of the known adaptive nonlinear control algorithms based on Lyapunov 
methods. A novel recursive robust adaptive control method by means of 
backstepping and small gain techniques is also developed to generate a new 
class of adaptive nonlinear controllers with robustness to nonlinear un- 
modelled dynamics. 

The next contribution is ‘Active identification for control of discrete-time 
uncertain nonlinear systems’ by J. Zhao and I. Kanellakopoulos. A novel 
approach is proposed to remove the restrictive growth conditions of the 
nonlinearities and to yield global stability and tracking for systems that can 
be transformed into an output-feedback canonical form. The main novelties of 
the design are (i) the temporal and algorithmic separation of the parameter 
estimation task from the control task and (ii) the development of an active 
identification procedure, which uses the control input to actively drive the 
system state to points in the state space that allow the orthogonalized 
projection estimator to acquire all the necessary information about the 
unknown parameters. It is proved that the proposed algorithm guarantees 
complete identification in a finite time interval and global stability and 
tracking. 




xviii Preface 



The next contribution is ‘Optimal adaptive tracking for nonlinear systems’ 
by M. Krstic and Z. H. Li. In this chapter an ‘inverse optimal’ adaptive 
tracking problem for nonlinear systems with unknown parameters is defined 
and solved. The basis of the proposed method is an adaptive tracking control 
Lyapunov function (atclf) whose existence guarantees the solvability of the 
inverse optimal problem. The controllers designed are not of certainty 
equivalence type. Even in the linear case they would not be a result of solving 
a Riccati equation for a given value of the parameter estimate. Inverse 
optimality is combined with backstepping to design a new class of adaptive 
controllers for strict-feedback systems. These controllers solve a problem left 
open in the previous adaptive backstepping designs - getting transient per- 
formance bounds that include an estimate of control effort. 

The next contribution is ‘Stable adaptive systems in the presence of non- 
linear parameterization’ by A. M. Annaswamy and A. P. Loh. This chapter 
addresses the problem of adaptive control when the unknown parameters 
occur nonlinearly in a dynamic system. The traditional approach used in 
linearly parameterized systems employs a gradient-search principle in estimat- 
ing the unknown parameters. Such an approach is not sufficient for nonlinearly 
parametrized systems. Instead, a new algorithm based on a min-max optimiza- 
tion scheme is developed to address nonlinearly parametrized adaptive systems. 
It is shown that this algorithm results in globally stable closed loop systems 
when the states of the plant are accessible for measurement. 

The next contribution is ‘Adaptive inverse for actuator compensation’ by G. 
Tao. A general adaptive inverse approach is developed for control of plants 
with actuator imperfections caused by nonsmooth nonlinearities such as dead- 
zone, backlash, hysteresis and other piecewise-linear characteristics. An 
adaptive inverse is employed for cancelling the effect of an unknown actuator 
nonlinearity, and a linear feedback control law is used for controlling the 
dynamics of a known linear or smooth nonlinear part following the actuator 
nonlinearity. State feedback and output feedback control designs are presented 
which all lead to linearly parametrized error models suitable for the develop- 
ment of adaptive laws to update the inverse parameters. This approach 
suggests that control systems with commonly used linear or nonlinear feedback 
controllers such as those with an LQ, model reference, PID, pole placement or 
other dynamic compensation design can be combined with an adaptive inverse 
for improving system tracking performance despite the presence of actuator 
imperfections. 

The next contribution is ‘Stable multi-input multi-output adaptive fuzzy/ 
neural control’ by R. Ordonez and K. Passino. In this chapter, stable direct and 
indirect adaptive controllers are presented which use Takagi-Sugeno fuzzy 
systems, conventional fuzzy systems, or a class of neural networks to provide 
asymptotic tracking of a reference signal vector for a class of continuous time 
multi-input multi-output (MIMO) square nonlinear plants with poorly under- 
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stood dynamics. The direct adaptive scheme allows for the inclusion of a priori 
knowledge about the control input in terms of exact mathematical equations or 
linguistics, while the indirect adaptive controller permits the explicit use of 
equations to represent portions of the plant dynamics. It is shown that with or 
without such knowledge the adaptive schemes can ‘learn’ how to control the 
plant, provide for bounded internal signals, and achieve asymptotically stable 
tracking of the reference inputs. No initialization condition needs to be 
imposed on the controllers, and convergence of the tracking error to zero is 
guaranteed. 

The final contribution is ‘Adaptive robust control scheme with an applica- 
tion to PM synchronous motors’ by J. X. Xu, Q. W. Jia and T. H. Lee. A new, 
adaptive, robust control scheme for a class of nonlinear uncertain dynamical 
systems is presented. To reduce the robust control gain and widen the 
application scope of adaptive techniques, the system uncertainties are classified 
into two different categories: the structured and nonstructured uncertainties 
with partially known bounding functions. The structured uncertainty is 
estimated with adaptation and compensated. Meanwhile, the adaptive robust 
method is applied to deal with the non-structured uncertainty by estimating 
unknown parameters in the upper bounding function. It is shown that the new 
control scheme guarantees the uniform boundedness of the system and assures 
the tracking error entering an arbitrarily designated zone in a finite time. The 
effectiveness of the proposed method is demonstrated by the application to PM 
synchronous motors. 
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Adaptive internal model 
control 



A. Datta and L. Xing 



Abstract 

This chapter develops a systematic theory for the design and analysis of 
adaptive internal model control schemes. The principal motivation stems 
from the fact that despite the reported industrial successes of adaptive internal 
model control schemes, there currently does not exist a design methodology 
capable of providing theoretical guarantess of stability and robustness. The 
ubiquitous certainty equivalence principle of adaptive control is used to 
combine a robust adaptive law with robust internal model controllers to 
obtain adaptive internal model control schemes which can be proven to be 
robustly stable. Specific controller structures considered include those of the 
model reference, ‘partial’ pole placement, and H 2 and // x optimal control 
types. The results here not only provide a theoretical basis for analytically 
justifying some of the reported industrial successes of existing adaptive internal 
model control schemes but also open up the possibility of synthesizing new 
ones by simply combining a robust adaptive law with a robust internal model 
controller structure. 



1.1 Introduction 

Internal model control (IMC) schemes, where the controller implementation 
includes an explicit model of the plant, continue to enjoy widespread 
popularity in industrial process control applications [1], Such schemes can 
guarantee internal stability for only open loop stable plants; since most plants 
encountered in process control are anyway open loop stable, this really does 
not impose any significant restriction. 
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As already mentioned, the main feature of IMC is that its implementation 
requires an explicit model of the plant to be used as part of the controller. 
When the plant itself happens to be unknown, or the plant parameters vary 
slowly with time due to ageing, no such model is directly available a priori and 
one has to resort to identification techniques to come up with an appropriate 
plant model on-line. Several empirical studies, e.g. [2], [3] have demonstrated 
the feasibility of such an approach. However, what is, by and large, lacking in 
the process control literature is the availability of results with solid theoretical 
guarantees of stability and performance. 

Motivated by this fact, in [4], [5], we presented designs of adaptive IMC 
schemes with provable guarantees of stability and robustness. The scheme in [4] 
involved on-line adaptation of only the internal model while in [5], in addition 
to adapting the internal model on-line, the IMC parameter was chosen in a 
certainty equivalence fashion to pointwise optimize an Hi performance index. 
In this chapter, it is shown that the approach of [5] can be adapted to design 
and analyse a class of adaptive H x optimal control schemes that are likely to 
arise in process control applications. This class specifically consists of those H ^ 
norm minimization problems that involve only one interpolation constraint. 
Additionally, we reinterpret the scheme of [4] as an adaptive ‘partial' pole- 
placement control scheme and consider the design and analysis of a model 
reference adaptive control scheme based on the IMC structure. In other words, 
this chapter considers the design and analysis of popular adaptive control 
schemes from the literature within the context of the IMC configuration. A 
single, unified, analysis procedure, applicable to each of the schemes con- 
sidered, is also presented. 

The chapter is organized as follows. In Section 1.2, we present several 
nonadaptive control schemes utilizing the IMC configuration. Their adaptive 
certainty equivalence versions are presented in Section 1.3. A unified stability 
and robustness analysis encompassing all of the schemes of Section 1.3 is 
presented in Section 1.4. In Section 1.5, we present simulation examples to 
demonstrate the efficacy of our adaptive IMC designs. Section 1.6 concludes 
the chapter by summarizing the main results and outlining their expected 
significance. 



1.2 Internal model control (IMC) schemes: 
known parameters 

In this section, we present several nonadaptive control schemes utilizing the 
IMC structure. To this end, we consider the IMC configuration for a stable 
plant P(s) as shown in Figure 1.1. The IMC controller consists of a stable 
‘IMC parameter’ Q(s) and a model of the plant which is usually referred to as 
the ‘internal model’. It can be shown [1,4] that if the plant P(s) is stable and 
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Internal model controller 




Figure 1.1 The IMC configuration 

the internal model is an exact replica of the plant, then the stability of the IMC 
parameter is equivalent to the internal stability of the configuration in Figure 
1.1. Indeed, the IMC parameter is really the Youla parameter [6] that appears 
in a special case of the YJBK parametrization of all stabilizing controllers [4]. 
Because of this, internal stability is assured as long as Q(s) is chosen to be any 
stable rational transfer function. We now show that different choices of stable 
Q(s) lead to some familiar control schemes. 

7.2. 7 Partial pole placement control 

From Figure 1.1, it is clear that if the internal model is an exact replica of the 
plant, then there is no feedback signal in the loop. Consequently the poles of 
the closed loop system are made up of the open loop poles of the plant and the 
poles of the IMC parameter Q(s). Thus, in this case, a ‘complete’ pole 
placement as in traditional pole placement control schemes is not possible. 
Instead, one can only choose the poles of the IMC parameter Q(s ) to be in 
some desired locations in the left half plane while leaving the remaining poles at 
the plant open loop pole locations. Such a control scheme, where Q(s') is 
chosen to inject an additional set of poles at some desired locations in the 
complex plane, is referred to as ‘partial' pole placement. 

7.2.2 Model reference control 

The objective in model reference control is to design a differentiator-free 
controller so that the output y of the controlled plant P(s) asymptotically 
tracks the output of a stable reference model I V m (s) for all piecewise 
continuous reference input signals r{t). In order to meet the control objective, 
we make the following assumptions which are by now standard in the model 
reference control literature: 

(Ml) The plant Pis) is minimum phase; and 

(M2) The relative degree of the reference model transfer function I V m (s) is 
greater than or equal to that of the plant transfer function P(s') . 
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Assumption (Ml) above is necessary for ensuring internal stability since 
satisfaction of the model reference control objective requires cancellation of 
the plant zeros. Assumption (M2), on the other hand, permits the design of a 
differentiator-free controller to meet the control objective. If assumptions (Ml) 
and (M2) are satisfied, it is easy to verify from Figure 1.1 that the choice 

Q{s)= W m {s)P~\s) (1.1) 

for the 1MC parameter guarantees the satisfaction of the model reference 
control objective in the ideal case, i.e. in the absence of plant modelling errors. 

1.2.3 Hi optimal control 

In Hi optimal control, one chooses Q(s ) to minimize the L 2 norm of the 
tracking error r — y provided r — y € L 2 . From Figure 1.1, we obtain 

y = nOGMM 

=> r-y= [1 -F(.v)g(^)][r] 

poo 

=> / (r(r) - y(r)) 2 dT = (||[1 - / > (.v)Q(^)]i?(^)|| 2 ) 2 (using Parseval’s Theorem) 
J o 

where R(s) is the Laplace transform of r(t) and || • (,sj|| 2 denotes the standard 
H 2 norm. Thus the mathematical problem of interest here is to choose Q(s) to 
minimize ||[1 — F , (i)2(^)]^(^)|| 2 - The following theorem gives the analytical 
expression for the minimizing Q{s). The detailed derivation can be found in [1]. 

Theorem 2.1 Let P(s) be the stable plant to be controlled and let R(s) be the 
Laplace Transform of the external input signal r(t) 1 . Suppose that R(s) has no 
poles in the open right half plane 2 and that there exists at least one choice, say 
Qo(s), of the stable 1MC parameter £>(s) such that [1 — P(s)(2o(s)]-R('S) is 
stable 3 . Let z Pl , z pi , . . . , z Pl be the open right half plane zeros of P(s) and define 
the Blaschke product 4 

R , , (s + + - P2 ) ■ ■ • (~s + 2 PI ) 

P{) (s + z Pl ){s + z P2 )...( S + z Pl ) 

so that P(s) can be rewritten as 

P(s) = B p (s)P m (s) 

1 For the sake of simplicity, both P(s) and R(s') are assumed to be rational transfer 
functions. The theorem statement can be appropriately modified for the case where P(s) 
and/or R(s) contain all-pass time delay factors [1] 

2 This assumption is reasonable since otherwise the external input would be 
unbounded. 

3 The final construction of the Hi optimal controller serves as proof for the existence 
of a (2o(s) with such properties. 

4 Here ( 7 ) denotes complex conjugation. 
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where P M ( s) is minimum phase. Similarly, let z n . z r2 , . . . ,z n be the open right 
half plane zeros of R{s) and define the Blashcke product 

(•? + Z n )(.S + Z ri) ■ ■ ■ {s + z r k ) 

so that R{s) can be rewritten as 

R{s) = B r (s)R m (s) 

where Rm(s) is minimum phase. Then the Q(s) which minimizes 
|| [1 - P(j)e(j)]/?(j)|| 2 is given by 

<200 = Pm(s)Rm\s)[B- p 1 (s)R m (s)1 ( 1 . 2 ) 

where [•]* denotes that after a partial fraction expansion, the terms corre- 
sponding to the poles of B p l (s) are removed. 

Remark 2.1 The optimal Q(s) defined in (1.2) is usually improper. So it is 
customary to make Q(s) proper by introducing sufficient high frequency 
attenuation via what is called the TMC Filter’ F(s) [1]. Instead of the optimal 
Q(s) in (1.2), the Q(s) to be implemented is given by 

Q{s) = P m '(s)R m \s) [B~ p 1 (s)R m (s)]F(s) (1.3) 

where F(s) is the stable IMC filter. The design of the IMC filter for // 2 optimal 
control depends on the choice of the input R{s) . Although this design is carried 
out in a somewhat ad hoc fashion, care is taken to ensure that the original 
asymptotic tracking properties of the controller are preserved. This is because 
otherwise [1 — P{s)Q(s)]R{s) may no longer be a function in H 2 . As a specific 
example, suppose that the system is of Type l. 5 Then, a possible choice for the 
IMC filter to ensure retention of asymptotic tracking properties is 

F{s) = sr, t > 0 where n* is chosen to be a large enough positive 

(ts T 1) 

integer to make Q(s) proper. As shown in [1], the parameter r represents a 
trade-off between tracking performance and robustness to modelling errors. 



1.2.4 Hoo optimal control 

The sensitivity function S(s) and the complementary sensitivity function T(s) 
for the IMC configuration in Figure 1.1 are given by S(s) = 1 — P(s)Q(s) and 
T(s') = P(s)Q(s) respectively [1], Since the plant Pis) is open loop stable, it 
follows that the H ^ norm of the complementary sensitivity function T{s) can 



be made arbitrarily small by simply choosing Q{s) 



- and letting k tend to 
k 



5 Other system types can also be handled as in [1], 
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infinity. Thus minimizing the H x norm of T(s) does not make much sense 
since the inhmum value of zero is unattainable. 

On the other hand, if we consider the weighted sensitivity minimization 
problem where we seek to minimize || ^(s)S'(s)|| 00 for some stable, minimum 
phase, rational weighting transfer function IV (s ') , then we have an interesting 
tfoo minimization problem, i.e. choose a stable Q(s) to minimize 
||fF(,y)[l — _P(.s)Q(,s)]|| 0C .The solution to this problem depends on the number 
of open right half plane zeros of the plant P{s) and involves the use of 
Nevanlinna-Pick interpolation when the plant P(s) has more than one right 
half plane zero [7]. However, when the plant has only one right half plane zero 
b\ and none on the imaginary axis, there is only one interpolation constraint 
and the closed form solution is given by [7] 

= ('■ 4 ) 

Fortunately, this case covers a large number of process control applications 
where plants are typically modelled as minimum phase first or second order 
transfer functions with time delays. Since approximating a delay using a first 
order Pade approximation introduces one right half plane zero, the resulting 
rational approximation will satisfy the one right half plane zero assumption. 



Remark 2.2 As in the case of H 2 optimal control, the optimal £>(y) defined by 
(1.4) is usually improper. This situation can be handled as in Remark 2.1 so 
that the Q(s) to be implemented becomes 



GW 



W{b{) 

W(s) 



P~ 1 (s)F(s ) 



(1.5) 



where F(s) is a stable IMC filter. In this case, however, there is more freedom 
in the choice of F(s) since the H x optimal controller (1.4) does not necessarily 
guarantee any asymptotic tracking properties to start with. 



7.2.5 Robustness to uncertainties (small gain theorem) 

In the next section, we will be combining the above schemes with a robust 
adaptive law to obtain adaptive IMC schemes. If the above IMC schemes are 
unable to tolerate uncertainty in the case where all the plant parameters are 
known, then there is little or no hope that certainty equivalence designs based 
on them will do any better when additionally the plant parameters are 
unknown and have to be estimated using an adaptive law. Accordingly, we 
now establish the robustness of the nonadaptive IMC schemes to the presence 
of plant modelling errors. Without any loss of generality let us suppose that the 
uncertainty is of the multiplicative type, i.e. 

P(s) = Ro (■?)(! + pA,„(T)) 



( 1 . 6 ) 
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where Pq(s) is the modelled part of the plant and pA m (s) is a stable multi- 
plicative uncertainty such that Po(s)A m (s)h strictly proper. Then we can state 
the following robustness result which follows immediately from the small gain 
theorem [8]. A detailed proof can also be found in [1], 

Theorem 2.2 Suppose Pq (s) and Q(s) are stable transfer functions so that the 
IMC configuration in Figure 1.1 is stable for P(s) = Po(s). Then the IMC 
configuration with the actual plant given by (1.6) is still stable provided 

ii£ [0, p*) where p* = — — ^ — . 

IIPoCOeWM^IL 



1.3 Adaptive internal model control schemes 



In order to implement the IMC-based controllers of the last section, the plant 
must be known a priori so that the ‘internal model’ can be designed and the 
IMC parameter Q(s) calculated. When the plant itself is unknown, the IMC- 
based controllers cannot be implemented. In this case, the natural approach to 
follow is to retain the same controller structure as in Figure 1.1, with the 
internal model being adapted on-line based on some kind of parameter 
estimation mechanism, and the IMC parameter Q(s) being updated pointwise 
using one of the above control laws. This is the standard certainty equivalence 
approach of adaptive control and results in what are called adaptive internal 
model control schemes. Although such adaptive IMC schemes have been 
empirically studied inthe literature, e.g. [2, 3], our objective here is to develop 
adaptive IMC schemes with provable guarantees of stability and robustness. 

To this end, we assume that the stable plant to be controlled is described by 

P(s) = ^[l+pA m (s)] } p>0 (1.7) 



where Rp(s) is a tnonic Hurwitz polynomial of degree n; Z 0 (s) is a polynomial 

ry r / \ 

of degree / with / < «; - - represents the modelled part of the plant; and 

z < s \ 

pA m (s) is a stable multiplicative uncertainty such that — ttA m (s) is strictly 

7?o (s) 

proper. We next present the design of the robust adaptive law which is carried 
out using a standard approach from the robust adaptive control literature [9]. 



7.3. 7 Design of the robust adaptive law 
We start with the plant equation 

y = ^j\[i+^m(s)][u], p > 0 



( 1 . 8 ) 
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where u, y are the plant input and output signals. This equation can be 
rewritten as 

^o(j)M = z o (j)M + pA m (s)Z 0 (s)[u] 

Filtering both sides by where A (.s') is an arbitrary, monic, Hurwitz 

polynomial of degree «, we obtain 

(L9) 

The above equation can be rewritten as 

y = 0* T (j) + pri ( 1 . 10 ) 

where 9* = [9 \ T , Of] 7 "', 9\, 9\ are vectors containing the coefficients of 



[A(j) — A 0 (.s)] and Z 0 (s) respectively; tj> = [tf>f , 

, ai( s ) r i. 

^ 2 = T7^M. 



ii= ^5r w ' 



AW 



and 



n-i(-s) = [s n \s n 2 ,..., 1 ]' 
a,(s) = [.vV 1 ,...,!] 7 ' 



A (^)^o(^) r -I 

,,= AW “ 



(1.11) 



Equation (1.10) is exactly in the form of the linear parametric model with 
modelling error for which a large class of robust adaptive laws can be 
developed. In particular, using the gradient method with normalization and 
parameter projection, we obtain the following robust adaptive law [9] 



9 = Pr[yE(j)\, 9(0) G Cg 

y — 9 



£ = 



mr 



9 = o <t> 

nr = 1 + n~ 
m s = —8o m s + u 2 + y 2 



n;, n] = m s 



m s ( 0) = 0 



( 1 . 12 ) 

(1.13) 

(1.14) 

(1.15) 

(1.16) 



where 7 > 0 is an adaptive gain; Cg is a known compact convex set containing 
9*; Pr\-\ is the standard projection operator which guarantees that the param- 
eter estimate 9(t) does not exit the set Cg and ho > 0 is a constant chosen so that 

£ 

A m (s), are analytic in lZe[s\ > — y. This choice of 6 0 , of course, 

necessitates some a priori knowledge about the stability margin of the 
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unmodelled dynamics, an assumption which has by now become fairly 
standard in the robust adaptive control literature [9]. The robust adaptive 
IMC schemes are obtained by replacing the internal model in Figure 1.1 by 
that obtained from equation (1.14), and the IMC parameters Q(s ) by time- 
varying operators which implement the certainty equivalence versions of the 
controller structures considered in the last section. The design of these certainty 
equivalence controllers is discussed next. 

1.3.2 Certainty equivalence control laws 

We first outline the steps involved in designing a general certainty equivalence 
adaptive IMC scheme. Thereafter, additional simplifications or complexities 
that result from the use of a particular control law will be discussed. 

• Step 1: First use the parameter estimate 9(t) obtained from the robust 
adaptive law (1.1 2) ( 1.16) to generate estimates of the numerator and 
denominator polynomials for the modelled part of the plant 6 

Zo(s,t) = $2 (t)ctl(s) 

R 0 (s, t ) = AO) - e\ 0)a„_i0) 

• Step 2: Using the frozen time plant P(s,t) = calculate the appro- 

R 0 {s,t) 

priate Q(s , t ) using the results developed in Section 1.2. 

• Step 3: Express Q(s,t) as Q(s,t) = ^ where Q„(s,t) and Qd(s,t) are 

Qd(s,t) 

time- varying polynomials with Qd(s, t) being monic. 

• Step 4: Choose A] (s) to be an arbitrary monic Flurwitz polynomial of degree 
equal to that of Qd(s, t), and let this degree be denoted by nd- 

• Step 5: The certainty equivalence control law is given by 

» = gj (0 M + q n ( t ) [r - em 2 ] (1.17) 

where cjd(t) is the vector of coefficients of Ai(j) — Qd(s, t); q„{t) is the 
vector of coefficients of Q n (s,t); a nd (s) = [s" d . s nd ~\ ...,1] 7 and 
a„ d - l {s) = [^-\ s n “~ 2 , ...,l] r . 

The robust adaptive IMC scheme resulting from combining the control law 
(1.17) with the robust adaptive law (1 . 12) (1 . 16) is schematically depicted in 

6 In the rest of this chapter, the ‘hats’ denote the time varying polynomials/frozen 
time ‘transfer functions’ that result from replacing the time-invariant coefficients of a 

‘hat-free’ polynomial/transfer function by their corresponding time-varying values 
obtained from adaptation and/or certainty equivalence control. 
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Figure 1.2 Robust adaptive IMC scheme 



Figure 1.2. We now proceed to discuss the simplifications or additional 
complexities that result from the use of each of the controller structures 
presented in Section 1.2. 

1.3. 2.1 Partial adaptive pole placement 

In this case, the design of the IMC parameter does not depend on the estimated 
plant. Indeed, Q(s) is a fixed stable transfer function and not a time-varying 
operator so that we essentially recover the scheme presented in [4]. 
Consequently, this scheme admits a simpler stability analysis as in [4] although 
the general analysis procedure to be presented in the next section is also 
applicable. 

1.3. 2. 2 Model reference adaptive control 

In this case from (1.1), we see that the Q(s,t) in Step 2 of the certainty 
equivalence design becomes 

Q(s,t) = W m (s)[P(s,t)]~ l (1.18) 

Our stability analysis to be presented in the next section is based on results in 
the area of slowly time-varying systems. In order for these results to be 
applicable, it is required that the operator (?(,?, t) be pointwise stable and 
also that the degree of Qd(s, t) in Step 3 of the certainty equivalence design not 
change with time. These two requirements can be satisfied as follows: 

• The pointwise stability of £>(,?, t) can be guaranteed by ensuring that the 
frozen time estimated plant is minimum phase, i.e. Zo(s, t) is Hurwitz stable 
for every fixed t. To guarantee such a property for Zq(s, t ), the projection set 
Cg in (1.12) — (1.16) is chosen so that V 9 e Cg, the corresponding 
Z Q (s) = dja^s) is Hurwitz stable. By restricting Cg to be a subset of a 
Cartesian product of closed intervals, results from Kharitonov Theory [10] 
can be used to ensure that Cg satisfies such a requirement. Also, when the 
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projection set Cg cannot be specified as a single convex set, results from 
hysteresis switching using a finite number of convex sets [11] can be used. 
• The degree of Qd(s, t) can also be rendered time invariant by ensuring that 
the leading coefficient of Zo(s,t) is not allowed to pass through zero. This 
feature can be built into the adaptive law by assuming some knowledge 
about the sign and a lower bound on the absolute value of the leading 
coefficient of Z 0 (s). Projection techniques, appropriately utilizing this knowl- 
edge, are by now standard in the adaptive control literature [12]. 

We will therefore assume that for IMC-based model reference adaptive 
control, the set Cg has been suitably chosen to guarantee that the estimate 
6(t) obtained from (1.1 2) ( 1.16) actually satisfies both of the properties 
mentioned above. 



1.3. 2. 3 Adaptive H 2 optimal control 

In this case, Q{s , t) is obtained by substituting Pj}(s, t), B P l (s, t) into the right- 
hand side of (1.3) where Pm(s, t) is the minimum phase portion of P(s, t) and 
B P (s, t) is the Blaschke product containing the open right-half plane zeros of 
Zq(s, t). Thus Q(s, t) is given by 

Q(s,t) = P^(s,t)R^(s)[B- p l (s,t)R M (s)lF(s) (1.19) 

where [•]* denotes that after a partial fraction expansion, the terms corre- 
sponding to the poles of B p l (s, t) are removed, and F{s) is an IMC filter used 
to force Q(s, t) to be proper. As will be seen in the next section, specifically 
Lemma 4.1, the degree of Qd{s , t) in Step 3 of the certainty equivalence design 
can be kept constant using a single fixed F(s) provided the leading coefficient of 
Zq(s, t) is not allowed to pass through zero. Additionally Zfis, t) should not 
have any zeros on the imaginary axis. A parameter projection modification, as 
in the case of model reference adaptive control, can be incorporated into the 
adaptive law (1 . 12) ( 1.16) to guarantee both of these properties. 



1.3. 2. 4 Adaptive H ^ optimal control 

In this case, Q(s, t) is obtained by substituting P(s, t) into the right-hand side of 



(1.5), i.e. 



Q{s,t) 



W{h x ) 

W(s) 



R- l (s, t)F(s) 



( 1 . 20 ) 



where b\ is the open right half plane zero of Zo(s, t) and F(s') is the IMC filter. 
Since (1.20) assumes the presence of only one open right half plane zero, the 
estimated polynomial Zfis. t) must have only one open right half plane zero 
and none on the imaginary axis. Additionally the leading coefficient of Zfis, t) 
should not be allowed to pass through zero so that the degree of Qd(s, t) in Step 
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3 of the certainty equivalence design can be kept fixed using a single fixed F(s). 
Once again, both of these properties can be guaranteed by the adaptive law by 
appropriately choosing the set Cg. 

Remark 3.1 The actual construction of the sets Cg for adaptive model 
reference, adaptive H 2 and adaptive H ^ optimal control may not be 
straightforward especially for higher order plants. However, this is a well- 
known problem that arises in any certainty equivalence control scheme based 
on the estimated plant and is really not a drawback associated with the IMC 
design methodology. Although from time to time a lot of possible solutions to 
this problem have been proposed in the adaptive literature, it would be fair to 
say that, by and large, no satisfactory solution is currently available. 



1.4 Stability and robustness analysis 



Before embarking on the stability and robustness analysis for the adaptive 
IMC schemes just proposed, we first introduce some definitions [9, 4] and state 
and prove two lemmas which play a pivotal role in the subsequent analysis. 



Definition 4.1 For any signal x : [0, 00 ) — > R", x, denotes the truncation of x 
to the interval [0, t] and is defined as 



x,{t) 



x(t) if T<t 

0 otherwise 



( 1 . 21 ) 



Definition 4.2 For any signal x : [0, 00 ) — » R'\ and for any 6 > 0, t > 0, 1 1 x f 1 1 2 
is defined as 

IWl 2 =(^ e" 6 ('- T) [x 7 ’(r)x(r)] drj (1.22) 

The || ( • ) r 1 1 2 represents the exponentially weighted Li norm of the signal 
truncated to [0,/]. When <5=0 and ' = 110), II 2 becomes the usual Li 

norm and will be denoted by ||.|| 2 . It can be shown that ||.||2 satisfies the 
usual properties of the vector norm. 

Definition 4.3 Consider the signals x : [0, 00 ) — > R'\ y : [0, 00 ) — > R + and the 
set 

( rt+T rt+T 

S(y) = lx : [0, 00 ) — > R"\ J ' x r (r)x(r) dr < J y{r) dr + c 

for some c > 0 and V t. T > 0. We say that x is j-small in the mean if x € S(y). 

Lemma 4.1 In each of the adaptive IMC schemes presented in the last section, 
the degree of Qd(s, t) in Step 3 of the certainty equivalence design can be made 
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time invariant. Furthermore, for the adaptive H 2 and II y designs, this can be 
done using a single fixed F(s). 

Proof The proof of this lemma is relatively straightforward except in the case 
of adaptive H 2 optimal control. Accordingly, we first discuss the simpler cases 
before giving a detailed treatment of the more involved one. 



For adaptive partial pole placement, the time invariance of the degree of 
Qd(s, t) follows trivially from the fact that the IMC parameter in this case is 
time invariant. For model reference adaptive control, the fact that the leading 
coefficient of Zfis. t) is not allowed to pass through zero guarantees that the 
degree of Qd(s, t) is time invariant. Finally, for adaptive H x optimal control, 
the result follows from the fact that the leading coefficient of Zo(s,t) is not 
allowed to pass through zero. 

We now present the detailed proof for the case of adaptive H 2 optimal 
control. Let n r ,m r be the degrees of the denominator and numerator 
polynomials respectively of R{s). Then, in the expression for Q(s,t) in (1.19), 

Al/ , nth order polynomial , ... 

it is clear that P j (s, t) = — , — while R, (s) = 

M J /th order polynomial M 

n, th order polynomial r - , , N , ,, (n — l)th order polynomial 

m, th order polynomial L J * «th order polynomial 

where n < n r , strict inequality being attained when some of the poles of 

Rm(s) coincide with some of the stable zeros of Bp l (s, t). Moreover, in any 

case, the nth order denominator polynomial of [BjJ (.?, is a factor of 

the «, th order numerator polynomial of Rjf(s). Thus for the Q(s, t) given in 

(1.19), if we disregard F(s), then the degree of the numerator polynomial is 

n + — 1 while that of the denominator polynomial is / + m r ^n + n r — 1 . 

Hence, the degree of Q c /(s, t) in Step 3 of the certainty equivalence design can 

be kept fixed at (n + n r — 1), and this can be achieved with a single fixed F(s) of 

relative degree n — I + n r — m r — 1 , provided that the leading coefficient of 

Zq(s, t) is appropriately constrained. 



Remark 4.1 Lemma 4.1 tells us that the degree of each of the certainty 
equivalence controllers presented in the last section can be made time 
invariant. This is important because, as we will see, it makes it possible to 
carry out the analysis using standard state-space results on slowly time-varying 
systems. 

Lemma 4.2 At any fixed time t, the coefficients of Qd(s, t), Q„(s , t), and hence 
the vectors cjd(t), q n (t), are continuous functions of the estimate (fit). 



Proof Once again, the proof of this lemma is relatively straightforward except 
in the case of adaptive H 2 optimal control. Accordingly, we first discuss the 
simpler cases before giving a detailed treatment of the more involved one. 
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For the case of adaptive partial pole placement control, the continuity 
follows trivially from the fact that the IMC parameter is independent of 6{t). 
For model reference adaptive control, the continuity is immediate from (1.18) 
and the fact that the leading coefficient of Z 0 (s,t) is not allowed to pass 
through zero. Finally for adaptive H x optimal control, we note that the right 
half plane zero b\ of Zo(s,t) is a continuous function of 9(t). This is a 
consequence of the fact that the degree of Z 0 (s, t) cannot drop since its leading 
coefficient is not allowed to pass through zero. The desired continuity now 
follows from (1.20). 

We now present the detailed proof for the Hi optimal control case. Since the 
leading coefficient of Zo(s,t) has been constrained so as not to pass through 
zero then, for any fixed t, the roots of Zq(s, t) are continuous functions of 9{t). 
Flence, it follows that the coefficients of the numerator and denominator 
polynomials of[FA/(5, f)] _1 = [Bp{s 1 f)][P(s, f)] _l are continuous functions of 
9(t). Moreover, [[Bp(s, f)] _ 7 ?m(s)]* is the sum of the residues of 
[Bp(s,t)]~ l Rm(s) at the poles of Rm ( s) , which clearly depends continuously 
on 9{t) (through the factor [Bp(s, ?)] *). Since F(s) is fixed and independent of 
9, it follows from (1.19) that the coefficients of Q,r(s,t), Q„(s,t) depend 
continuously on 9(t). 

Remark 4.2 Lemma 4.2 is important because it allows one to translate slow 
variation of the estimated parameter vector 9(t) to slow variation of the 
controller parameters. Since the stability and robustness proofs of most 
adaptive schemes rely on results from the stability of slowly time-varying 
systems, establishing continuity of the controller parameters as a function of 
the estimated plant parameters (which are known to vary slowly) is a crucial 
ingredient of the analysis. 

The following theorem describes the stability and robustness properties of the 
adaptive IMC schemes presented in this chapter. 

Theorem 4.1 Consider the plant (1.8) subject to the robust adaptive IMC 
control law (1. 12)— (1. 16), (1.17), where (1.17) corresponds to any one of the 
adaptive IMC schemes considered in the last section and r(t) is a bounded 
external signal. Then, 3 fi* > 0 such that V/i€ [0, //), all the signals in the 

closed loop system are uniformly bounded and the error y — y e S ( c „ ) for 

„ n 7 ... mZ , 

some c > 0 v 7 

7 In the rest of this chapter, ‘c’ is the generic symbol for a positive constant. The exact 
value of such a constant can be determined (for a quantitative robustness result) as in 
[13, 9], However, for the qualitative presentation here, the exact values of these 
constants are not important. 
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Proof The proof is obtained by combining the properties of the robust 
adaptive law (1.12)— (1.16) with the properties of the IMC-based controller 
structure. We first analyse the properties of the adaptive law. 



From (1.10), (1.13) and (1.14), we obtain 

-B T f + pri ~ 



£ = • 



m- 



(1.23) 



Consider the positive definite function 

2j 

Then, along the solution of (1.12), it can be shown that [9] 

V < 0 T £(j) 

= e \—em 2 + p;rf\ (using (1.23) 



1 



< ~^£ 2 m 2 + - 



1 prrj 



2„2 



(completing the squares) 



(1.24) 



2 ' 2 m 2 

From (1.11), (1.15), (1.16), using Lemma 2.1 (Equation (7)) in [4], it follows 
n 

that — € Loo- Now, the parameter projection guarantees that V, 6, 9 € Loo- 
m 

Hence integrating both sides of (1.24) from t to t+ T, we obtain 

em € S ( 



Also from (1.12) 



2 2 
fir] 



< fern 



\<f>\ 



m 



(1.25) 



From the definition of <j>, it follows using Lemma 2.1 (equation (7)) in [4] that 



2 2 

— € Loo , which in turn implies that 9 e S ( c — ) . This completes the analysis 

m \ m z J 

of the properties of the robust adaptive law. To complete the stability proof, we 
now turn to the properties of the IMC-based controller structure. 

The certainty equivalence control law (1.17) can be rewritten as 



Ai(s) 



[«] + (t) 



cfld- 1 



Ai(j) 









where (3i(t), fait),- ■ ■ , P„ d (t) are the time-varying coefficients of Qd(s,t). 



Defining 



xi = 



1 5 

\U\,X2 = 



r,ltd 1 



Ai(^) 



Ai(j) 



• • 5 Xn d — 



Ai (^) 



[u),X = [x\,X 2 ,-.-,X nd ) 
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the above equation can be rewritten as 



where 



X = A{i)X + Bq T n {t) a -^[r-em 2 ] 



m= 



0 

0 



1 0 • • 0 
0 10 - 0 



-AvM -A rf -i(0 • • • -AO) 

'o' 

0 

54 . 

0 

1 



(1.26) 



Since the time- varying polynomial Qd(s, t) is pointwise Hurwitz, it follows that 
for my fixed t , the eigenvalues of A(t) are in the open left half plane. Moreover, 
since the coefficients of Qd(s, t) are continuous functions of 0(t) (Lemma 4.2) 
and 0{t) £ Co, a compact set, it follows that 3 a s > 0 such that 



5c{A,(y4(/))} < — u s V t > 0 and i = 1,2 ,nj 



The continuity of the elements of A(t) with respect to 9{t) and the fact that 

{ ■ ( uV\ 

6 £ 5 c — — — I together imply that A(t) £ 5 c , I. Hence, using the fact 
y m~ J y m- J 

that — £ Loo, it follows from Lemma 3.1 in [9] that 3 n’t > 0 such that 
m 

V ^ € [0, A), the equilibrium state x e = 0 of x = A(t)x is exponentially 
stable, i.e. there exist co,po > 0 such that the state transition matrix $(t, r) 
corresponding to the homogeneous part of (1.26) satisfies 

||4>(t,r)| < coe~ M, - T) V t > r (1.27) 



From the identity u 
rewritten as 



where 

v{t) = 



u], it is easy to see that the control input u can be 



_ A iC0 

’ A iW 

u= v T {i)X + q T n {t) and ^'" ™ 21 



Ai(j) 



[r — enr] 



(1.28) 



[A, v P ni ( 1 ) , A „ d _ x (3„ d | (t) , ■ ■ ■ , Ai A (/)] 



and 



Aj(j) = s* + A 1 j*'-‘ + ... + A* 
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Also, using (1.28) in the plant equation (1.8), we obtain 






v T (t)X + q T n (/) [r - em 1 ] 



(1.29) 



Now let 6 G (0, min[(5o ; /?o]j be chosen such that Rq(s), Ai(j) are analytic in 



lZe[s\ > — and define the hctitious normalizing signal m/{t) by 



/«/■(?) = 1.0+ IMI2 + IWI2 


(1.30) 


As in [9], we take truncated exponentially weighted norms on both sides of 
(1.28), (1.29) and make use of Lemma 3.3 in [9] and Lemma 2.1 (equation (6)) 
in [4], while observing that v(t), q„(t), r(t) G L 0 c , to obtain 


INI2 < c + c||(em 2 ),||2 


(1.31) 


IWI2 < c + c||(em 2 ) ; ||2 


(1.32) 


which together with (1.30) imply that 




m f {t) < C + C||(£7W 2 ),||2 


(1.33) 



Now squaring both sides of (1.33) we obtain 

mj(t) < c+ c f e E?m 2 mj-(r)dT (since m(t) < m/(t ) 

J o 



=>■ m 2 (t) < c + c 



e-6(t-s) £ 2 (s)m 2 {s) 




ds 



(using the Bellman-Gronwall lemma [8]) 

[p 2 rr\ r; 

Since em € S — =- and — is bounded, it follows using Lemma 2.2 in [4] that 
\ m- ) m 

3 p* G (0, p\) such that V/iG [0, /lx*), mj G L x , which in turn implies that 

(h Tj 

m G Loo. Since — are bounded, it follows that 0, ri G L^. Thus 
m m 

em 2 = —9 T (j) + pp is also bounded so that from (1.26), we obtain X G L x . 

From (1.28), (1.29), we can now conclude that u, y G L 0 c . This establishes the 

boundedness of all the closed loop signals in the adaptive IMC scheme. Since 

fp 2 p 2 \ f p 2 ri 2 \ 

y —y = em 2 and em eS — y- J , m G L x , it follows that y — y G S ( c - — y- J as 

claimed and, therefore, the proof is complete. 



Remark 4.3 The robust adaptive IMC schemes of this chapter recover the 
performance properties of the ideal case if the modelling error disappears, i.e. 
we can show that if p = 0 then y — y — > 0 as t — > oo. This can be established 
using standard arguments from the robust adaptive control literature, and is a 
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consequence of the use of parameter projection as the robustifying modifica- 
tion in the adaptive law [9]. An alternative robustifying modification which can 
guarantee a similar property is the switching-cr modification [14]. 



1.5 Simulation examples 



In this section, we present some simulation examples to demonstrate the 
efficacy of the adaptive IMC schemes proposed. 

We first consider the plant (1.7) with Zq{s) = s + 2, Rq(s) = s 2 + s + 1, 
s + 1 

A m = and /z = 0.01. Choosing <5 0 = 0.1, 7=1, A(^) = s 2 + 2s + 2, 

s “l - 3 . 

C 0 = [-5.0, 5.0] x [-4.0, 4.0] x [0.1, 6.0] x [-6.0, 6.0], Q(s) = and im- 

s 4 

plementing the adaptive partial pole placement control scheme (1 . 12)— (1 . 16), 

(1.17) , with 0(0) = [—1.0, 2.0, 3.0, 1 .0] 7 and all other initial conditions set to 
zero, we obtained the plots in Figure 1.3 for r(t) = 1.0 and r(t) = sin (0. 2/) . 

5 + 2 

From these plots, it is clear that y(t) tracks — — hj quite well. 

w (s 2 + s+ 1)0 + 4) L J s 

Let us now consider the design of an adaptive model reference control 
scheme for the same plant where the reference model is given by 

W m (s) = , ] . The adaptive law (1.12)— (1.16) must now guarantee 

that the estimated plant is pointwise minimum phase, to ensure which, we 
now choose the set Cg as Cg = [-5.0, 5.0] x [—4.0, 4.0] x [0.1, 6.0] x [0.1, 6.0]. 
All the other design parameters are exactly the same as before except that now 

(1.17) implements the IMC control law (1.18) and Ai(^) = 5 3 + 2s 1 + 2s + 2. 
The resulting plots are shown in Figure 1.4 for r(t) = 1.0 and r(t) = sin(0.2f). 
From these plots, it is clear that the adaptive IMC scheme does achieve model 
following. 

The modelled part of the plant we have considered so far is minimum phase 
which would not lead to an interesting H 2 or H x optimal control problem. 
Thus, for H 2 and H x optimal control, we consider the plant (1.7) with 

s \ 

Z 0 (s) = —s+ 1, Rq(s) = s 2 + 3s + 2, A„, = ^ and /z = 0.0 1 . Choosing 



(>0 = 0.1, 7=1, A(^) = s 2 + 2s + 2, Ai (5) = 
[-4.0, 4.0] x [-6.0, -0.1] x [-6.0, 6.0], F(s) 



5 2 + 25 + 2, Cg = [-5.0, 5.0] x 
= — =■ and implementing the 

(*+l) 2 



adaptive H 2 optimal control scheme (1. 12)— (1. 16), (1.1 7), with 

0(0) = [2.0, 2.0, —2.0, 2.0 ] T and all other initial conditions set to zero, we 



obtained the plot shown in Figure 1.5. From Figure 1.5, it is clear that y(t) 
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time t (second) 



asymptotically tracks r{t) quite well. Note that the projection set Cg here has 
been chosen to ensure that the degree of Zq(s, t) does not drop. 

Finally, we simulated an H x optimal controller for the same plant used for 

the H 2 design. The weighting W(s) was chosen as W(s) = ^ and the set 

Cg was taken as Cg = [-5.0, 5.0] x [—4.0, 4.0] x [-6.0, —0.1] x [0.1, 6.0]. This 
choice of Cg ensures that the estimated plant has one and only one right half 
plane zero. Keeping all the other design parameters the same as in the H 2 
optimal control case and choosing r(t) = 1.0 and r(t) = 0.8 sin(0.2t), we 
obtained the plots shown in Figure 1.6. From these plots, we see that the 
adaptive //oo-optimal controller does produce reasonably good tracking. 



1.6 Concluding remarks 

In this chapter, we have presented a general systematic theory for the design 
and analysis of robust adaptive internal model control schemes. The certainty 
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time t (second) time t (second) 

Figure 1.4 MR AC !MC simulation 




Figure 1.5 Hi IMC simulation 



time t (second) 
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time t (second) 
Figure 1.6 H^, /MC simulation 




equivalence approach of adaptive control was used to combine a robust 
adaptive law with robust internal model controller structures to obtain 
adaptive internal model control schemes with provable guarantees of robust- 
ness. Some specific adaptive IMC schemes that were considered here include 
those of the partial pole placement, model reference, H 2 optimal and H x 
optimal control types. A single analysis procedure encompassing all of these 
schemes was presented. 

We do believe that the results of this chapter complete our earlier work on 
adaptive IMC [4, 5] in the sense that a proper bridge has now been established 
between adaptive control theory and some of its industrial applications. It is 
our hope that both adaptive control theorists as well as industrial practitioners 
will derive some benefit by traversing this bridge. 
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An algorithm for robust 
adaptive control with less 
prior knowledge 

G. Feng, Y. A. Jiang and R. Zmood 



Abstract 

A new robust discrete-time singularity free direct adaptive control scheme is 
proposed with respect to a class of modelling uncertainties in this chapter. Two 
key features of this scheme are that a relative dead zone is used but no 
knowledge of the parameters of the upper bounding function on the class of 
modelling uncertainties is required, and no knowledge of the lower bound on 
the leading coefficient of the parameter vector is required to ensure the control 
law singularity free. Global stability and convergence results of the scheme are 
provided. 



2.1 Introduction 

Since it was shown (e.g. [1], [2]) that unmodelled dynamics or even a small 
bounded disturbance could cause most of the adaptive control algorithms to go 
unstable, much effort has been devoted to developing robust adaptive control 
algorithms to account for the system uncertainties. As a consequence, a 
number of adaptive control algorithms have been developed, for example, 
see [3] and references therein. Among those algorithms are simple projection 
(e.g. [4], [5]), normalization (e.g. [6], [7]), dead zone (e.g. [8-12]), adaptive law 
modification (e.g. [13], [14]), er-modification (e.g. [15], [16]), as well as persistent 
excitation (e.g. [17], [18]). 

In the case of the dead zone based methods, a fixed dead zone can be used 
[6-8] in the presence of only bounded disturbance, which turns off the 
algorithm when the identification error is smaller than a certain threshold. In 
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order to choose an appropriate size of the dead zone, an upper bound on the 
disturbance must be known. 

When unmodelled dynamics are present, a relative dead zone modification 
should be employed [11], [12]. Here the knowledge of the parameters of 
bounding function on the unmodelled dynamics and bounded disturbances is 
required. 

However, such knowledge, especially knowledge of the nonconservative 
upper bound or the parameters of the upper bounding function, can be 
hardly obtained in practice. Therefore, the robust adaptive control algorithm 
which does not rely on such knowledge is in demand but remains absent in the 
literature. One may argue that the robustness of the adaptive control 
algorithms can be achieved with only simple projection techniques in param- 
eter estimation [4], [5]. However, it should be noted that using the robust 
adaptive control algorithms such as the dead zone, the robustness of the 
resulting adaptive control systems will be improved in the sense that the 
tolerable unmodelled dynamics can be enlarged [19]. Therefore, discussion of 
the robust adaptive control approaches such as those based on the dead zone 
technique is still of interest and the topic of this chapter. 

Another potential problem associated with adaptive control is its control law 
singularity. The estimated plant model could be in such a form that the pole- 
zero cancellations occur or the leading coefficient of the estimated parameter 
vector is zero. In such cases, the control law becomes singular and thus cannot 
be implemented. In order to secure the adaptive control law singularity free, 
various approaches have been developed. These approaches can be classified 
into two categories. One relies on persistent excitation. The other depends on 
modifications of the parameter estimation schemes. 

In the latter case, the most popular method is to hypothesize the existence of 
a known convex region in which no pole-zero cancellations occur and then to 
develop a convergent adaptive control scheme by constraining the parameter 
estimates inside this region (e.g. [11], [20-22]) for pole placement design; or to 
hypothesize the existence of a known lower bound on the leading coefficient of 
the parameter vector and then to use an ad hoc projection procedure to secure 
the estimated leading coefficient bounded away from zero and thus achieve the 
convergence and stability of the direct adaptive control system. However, such 
methods suffer the problem of requirement for significant a priori knowledge 
about the plant. 

Recently, another approach has been developed which also modifies the 
parameter estimation algorithm. This approach is to re-express the plant model 
in a special input-output representation and then use a correction procedure in 
the estimation algorithm to secure the controllability and observability of the 
estimated model of the system [23-24]. They also addressed the robustness 
problem of such algorithms with respect to bounded disturbance [25] using the 
dead zone technique. They did not address the robustness problem with respect 
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to unmodelled dynamics. Moreover, those algorithms also suffer the same 
problem as the usual dead zone based robust adaptive control algorithms. That 
is, they still require the knowledge of the upper bound on the disturbance or 
the parameters of the upper bounding function on the unmodelled dynamics 
and disturbances. 

In this chapter, a new robust direct adaptive control algorithm will be 
proposed which does use dead zone but does not require the knowledge of the 
parameters of the upper bounding functions on the unmodelled dynamics and 
the disturbance. It has also been shown that our algorithm can be combined 
with the parameter estimate correction procedure, which was originated in [24] 
to ensure the control law singularity free, so that the least a priori information 
is required on the plant. 

The chapter is organized as follows. The problem is formulated in Section 
2.2. Ordinary discrete time direct adaptive control algorithm with dead zone is 
reviewed in Section 2.3. Our main results, a new robust direct adaptive control 
algorithm and its improved version with control law singularity free are 
presented in Section 2.4 and Section 2.5 respectively. Section 2.6 presents 
one simulation examples to illustrate the proposed adaptive control algo- 
rithms, which is followed by some concluding remarks in Section 2.7. 



2.2 Problem formulation 

Consider a discrete time single input single output plant 

z~ d B(z~ i ) 

y(t)= A(z . 1} J u(t) + v(t) (2.i) 

where y(t) and u(t) are plant output and input respectively, v{t) represents the 
class of unmodelled dynamics and bounded disturbances and d is the time 
delay. A(z~ l ) and B(z~ x ) are polynomials in z~ l , written as 

A(z~ l ) = 1 + ci\z~ x + . . . + a„ z~" 

B(z~ l ) = b\z~ x + biz ~ 2 + . . . + b m z~ m 

Specify a reference model as 

E{z~ l )y * (t) = z~ d R(z~ l )r{i) 

where E(z~ l ) is a strictly stable monic polynomial written as 
E(z *) = 1 + e\z * + ... + enz 



( 2 . 2 ) 
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Then, there exist unique polynomials F(r _1 ) and G{z~ x ) written as 

F(z~ l ) = 1 +/i z- 1 + . . . +fd-\-~ d+X 

G(z~ l ) = g 0 +giz~ l + . . . +g n -iz~ n+x 
such that , , , , 

E(z~ l ) = F(z- l )A(z- l ) + z- d G(z~ l ) (2.3) 

Using equation (2.3), it can be shown that the plant equation (2.1) can be 
rewritten as 

y(t + d) = a(z -1 )y(?) + /3(z~ l )u(t) 

= e T 0(t) + V (t + d) (2.4) 

where 

y(t + d) — E(z~ l )y(t + d) 
ri(t + d) = F{z~ x )A{z~ l )v{t + d) 



4>(t) T = [u(t),u(t - 1 ),. . . ,u(t — m - d + \),y{t),y(t - 1), . . . ,y(t -n + 1)] 
:= MtU'itf] 

e T = [i 9 l ,...e n+m+d } ■- [e u e' T ] 

a(z~ l ) = G{ z~ l ) 
p(z~ l ) = F(z-')B(z~') 



We make the following standard assumptions [26], [18]. 

(Al) The time delay d and the plant order n are known. 
(A2) The plant is minimum phase. 

For the modelling uncertainties, we assume only: 

(A3) There exists a function [11] 7 (?) such that 

W0I 2 <7(0 



where 7 (t) satisfies 

7 (0 < £1 sup ||x(t)|| 2 +£ 2 

0<T<? 



for some unknown constants £1 > 0, £2 > 0, and x(t) is defined as 

x(t) = [y{t — 1), . . . ,y{t — ri),u(t — 1 u(t — m — d)] T 

For the usual direct adaptive control, in order to facilitate the implementa- 
tion of projection procedure to secure the control law singularity free, the 
following assumption is required. 
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(A4) There is a known constant 6 ] m satisfying 

l<l < 1^1 and ey>0 

For the usual relative dead zone based direct adaptive control algorithm, 
another assumption is needed as follows: 

(A5) The constants eiand £2 in (A3) are known a priori. 

Remark 2.1 It should be noted that the assumptions (A4) and (A5) will not be 
required in our new adaptive control algorithm to be developed in the next few 
sections. It is believed that the elimination of assumptions (A4) and (A5) will 
improve the applicability of the adaptive control systems. 



2.3 Ordinary direct adaptive control with dead zone 



Let 9(t) denote the estimate of the unknown parameter 8 for the plant model 
(2.4). Defining the estimation error as 



and a dead zone function as 



m - 


1 


(2.5) 


e~g 


if e > g 




0 


if M < g 0 < g < 00 


(2.6) 


e + g 


if e < -g 





then the following least squares algorithm with a relative dead zone can be used 
for parameter estimation 

P{t - - d) 



0{l) = proj{d(? - 1) + a(t) 



1 + cj)(t — d) 1 P(t — !)</>(? — d) 



e(t)} 



P(t) = P(t- 1 )-a(t) 



P{t - \)<j>{t - d)(j>(t - d) T P(t - 1) 



\ + cj)(t ~ d) T P{t - - d) 

P{-\) = k 0 I, A’o > 0 

where the term a(t) is a dead zone, which is defined as follows: 

a (A = { 0 *f l e M| 2 — £r(0 

l a /(£ 1 ' /2 7( ? ) 1 ' /2 j e M)/ e ( ? ) otherwise 



(2.7) 



( 2 . 8 ) 



with 0 < a < 1, = 



Co 

1 — a 



£0 > 1, and proj is the projection operator [26] 
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such that 



e\t) 



6\t) if 6\i) sgn (9j n ) > \9) n \ 
9 l m otherwise 



(2.9) 



It has been shown that the above parameter estimation algorithm has the 
following properties: 



(i) 9(t) is bounded 



(ii) 

(hi) 



(iv) 



\0 l (t) > \9) n \ and 



1 

W) 



9 l 

W) 



ne /2 i(t) i/2 ,e(t )) 2 

1 + (j>(t - d) T P(t - 1 )(j>{t-d) 

.m 1/2 7(0 1/2 , £ (0) 2 

1 +<X? -d) T P{t -!)(/-(? -d) 



1 

¥ 



where 

e(t) = y(t ) — 9(t — d) T cj){t — d ) 
The direct adaptive control law can be written as 

u{t)= rA<)-m T nt) 

m 

where 

r f(t) = R ( z ) r (t) 



( 2 . 10 ) 



( 2 . 11 ) 

( 2 . 12 ) 



with r(t') a reference input and R(z~ l ) is specified by the reference model 
equation (2.2). 

Then with the parameter estimation properties (i)-(iv), the global stability 
and convergence results of the adaptive control systems can be established as in 
[26], [11], which are summarized in the following theorem. 



Theorem 3.1 The direct adaptive control system satisfying assumptions (A 1 ) - 
(A5) with adaptive controller described in equations (2.7)-(2.9) and (2.11) is 
globally stable in the sense that all the signals in the loop remain bounded. 

However, as discussed in the first section, the requirement for knowledge of 
the parameters of the upper bounding function on the unmodelled dynamics 
and bounded disturbances is very restrictive. In the next section, we are 
attempting to propose a new approach to get rid of such a requirement. 



2.4 New robust direct adaptive control 

Here we develop a new robust adaptive control algorithm which does not need 
such knowledge. That is, we drop the assumption (A5). 
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The key idea is to use an adaptation law to update those parameters. The 
new parameter estimation algorithm is the same as equations (2. 7-2. 9) but with 
a different dead zone a(t), where 



if |e(0f<£(7(0 + G(0) 



a/(£ 1/2 (7(0 + Q(t)) 1/2 ,e(t))/e{t) otherwise 




/3 > 0 (2.16) 

where 

C{t) T = [ei h] 

with zero initial condition. It should be noted that i\ and i 2 will be always 
positive and non decreasing. 

As shown in [23], the projection operation does not alter the convergence 
properties of the original parameter estimation algorithms. Therefore, in the 
following analysis, the projection operation will be neglected. 

The properties of the above modified least squares parameter estimator with 
a relative dead zone are summarized in the following lemma. 

Lemma 4.1 The least squares algorithm with equations (2.7), (2.9), (2.13)- 
(2.16) applied to any system has the following properties irrespective of the 
control law: 

(i) 9(t) is bounded 

(ii) C(t) is bounded and non-decreasing, and thus £\ converges to a constant, 
say e\ 

(iii) \9 x {t) > \9) n \ and = 

9 yt) 



9 l 1 
¥(f) 9 r 



(iv) \\§(t) - 9(t - l)\\ e l 2 
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(v)/(0 2 : 



M 

1 + <j>{t - d) T P{t -!)</>(? -d) 



ne /2 m+Q(t)) i/2 ,e(t)) 2 c/ 

1+ <j)(t — d) 7 P(t — \)4>{t —d) 



Proof Define a Lyapunov function candidate 

V(t+l) =\{0{t) T P{t- \y l 6{i) + C{t+ l) r /3 _ 1 C(t + 1)) (2.17) 

where 9(t) = 9(t) — 9*, C(t+ 1) = C(t+ 1) — [e\ £ 2 ]- Then, its difference 

becomes 






ct(t) 

1 + (f>(t - d) T P(t - \)<t>{t-d) 

X ' l+f{t-d) T P{t-mt^d) 2 2 

. 1 + (1 — a(t))(j)(t — d) 1 P(t — 1 )(j>(t — d) 



+ a{i)C{f) T 

(1 - a)(l + <j>{t - d) T P{t - 1 - d)) 



sup ||.\'(r)|| 2 

0 <T<t 

1 



H a(t) 2 P 

4(1 - a) 2 (l + (j>{t - d) T P(t - \)f{t - d)) 2 





sup ||.\-(t)|| 2 


T 


sup ||x(r)|| 2 


X 


0 <T<t 




0<T<f 




1 




1 



a{t) 

1 +f(t -d) T P{t- l)<j>(t-d) 

1 + fit - d) T P(t - - d) 

. 1 + (1 — a(t))<j)(t — d) T P(t — 1 )<j>(t — d) ^ 

4 AA (C(t) 1 

(1 - a)(l + fit - d) T P{t -1)0(/- d)) 



X 



sup ||.\-(t)|| 2 

0 <T<t 



+ 2 ( 0 ) 



1 




< 



a(t) 

1 +</>(t- d) T P(t - 1 )<j>(t - d) L (1 - a) 
+ a(0 

(l-a)(l + - d) T P{t -\)(f>{t - d)) 



1 h{t) 2 - e{t ) 2 



( 7(0 - 7(0 + 2 ( 0 ) 
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< 



ci{i) 

1 + <j>{t - d) T P(t - - d) 



1 v(t) 1 - e{tf + 1 (7(0 - 7(0 + 8(0) 



(1 - a) 



(1 - a) 



< 



d{t) 



1 + <p(t - d) 1 P(t - \)<j>(t-d) 



- e W 2 + (j4^)WW + 2(0) 



< 



d(t) 



1 + <p(t-dyP(t- \)<j>(t-d) 



- e(t) + 



2 , 11 fA 2 

a ^ir (,) 



< 



d(t) 

1 + <f>(t - d) T P(t - 1 )<j>(t-d) 



~ e (tf + 



< e(t) 2 



< - 



Co-1 



/(0 



Co l + (j>{t - df P{t - \)<!>{t-d) 



(2.18) 



where the fact that o(t)e(t) 2 = f(t)e(t) >f(t) 2 has been used. Therefore, 
following the same arguments in [20], [21], [23], the results in Lemma 4.1 are 
thus proved. 



If using the same adaptive control law as in equation (2.11), then with the 
parameter estimation properties (i)-(v) in Lemma 4.1, the global stability and 
convergence results of the new adaptive control system can be established as in 
[26], [11] as long as the estimated e\ is small enough, which are summarized in 
the following theorem. 



Theorem 4.1 The direct adaptive control system satisfying assumptions (Al)- 
(A4) with the adaptive controller described in equations (2.7), (2.9), (2.13)- 
(2.16) and (2.11) is globally stable in the sense that all the signals in the loop 
remain bounded. 

In this approach, we have eliminated the requirement for the knowledge of 
the parameters of the upper bounding function on the modelling uncertainties. 
But the requirement for the knowledge of the lower bound on the leading 
coefficient of the parameter vector, i.e. assumption (A4) is still there. In the 
next section, the technique of the parameter correction procedure will be 
combined with the algorithm developed in this section to ensure the least prior 
knowledge on the plant. That is, only assumptions (A1)-(A3) are needed. 
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2.5 Robust adaptive control with least prior knowledge 



The following modified least squares algorithm will be used for robust 
parameter estimation: 



6(i) = 9(t - \) + a(t) 
P(t) = P(t - 1 ) - a{t) 



P{t-\)<!>(t-d) 

1 + <f>(t - d) T P{t - - d) [ 

P(t - !)(/>(? - d)(j>(t - d) T P{t - 1) 
1 + c/){t - d) T P{t - l)0(r -d) 



P{-l) = k 0 I , ko > 0 



(2.19) 



and the parameter estimate is then corrected [24] as 

0(0 = 0(f) + P(t)0(t) (2.20) 

where 

e( t ) = y(t) - 0(t - \) T (j>(t-d) (2.21) 

the vector (3{t) is described in Figure 2. 1 




where p(t) is the first column of the covariance matrix P{t), and the term a(t) is 
now defined as follows: 

0 if e(t)| 2 <£(7(0 + 2(0) 

a/(£ 1/2 (7(0 + Q(t)f 2 ,e(t))/e(t) otherwise 
2£o 

with 0 < a < 1, £ = , £o > 1, and 

1 — a 

Q(t) = [p(t~i) T p(t-mt-d )} 2 

a\ 

2{\^ a){\+ (t>{t - d) T P{t -\)(t>(t -d) 



sup ||x(r)|| 


I 


sup ||.\'(r)||- 


0<T</ 




0<T<? 


1 




1 
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And 7(0 is calculated by 



7 0) 



C(t) T 



sup ||x(r)|| 2 

0 <T<t 



c(0 = c(t 



1 J 

a(t) A 

(1 - a)(l + 0(f - </) r P(f - !)<£(* - rf) 



( 2 . 22 ) 

sup ||x(r)|| 2 

0 <T<t 

5 

1 

A > 0 (2.23) 



where 

C{t) T = [£1 £2] 



with zero initial condition. It should be noted that e\ and £ 2 will be always 
positive and non-decreasing. 

Remark 5.1 The prediction error e(t) is used in the modified least squares 
algorithm to ensure that the estimator property (iii) in the following lemma can 
be established. 



The properties of the above modified least squares parameter estimator are 
summarized in the following lemma. 

Lemma 5.1 If the plant satisfies the assumptions (A1)-(A3), the least squares 
algorithm (2. 1 9) (2.23) has the following properties: 

(i) 9(t) is bounded, and ||0(r) — 9{t — 1)|| € h- 

(ii) C(t) is bounded and non-decreasing, thus converges 

(iii) nmm+Qior.mf Gh 

1 + cf>(t — d) 1 P(t — 1 )cf>{t — d ) 

(iv) ||p(t)|| + |0'(?)| > b m in where 

b • = 1^1 

mn max(l, ||/3*||) 

with (3* dehned such that 

9* = 9(t) = P{t)p* 

(v) \9 l (t)\ > 

3 + £ 

(vi) 6(t) is bounded, and ||0(t) — 9(t — 1)|| € / 2 
Proof Define a Lyapunov function candidate 

V(t+ 1) = i (9(t) r P(t)~ l 9(t) + C(t+ l) r A ~'C(t+ 1)) (2.24) 
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where 6{t) = 9{t) — 0*, C(t+ 1) = C(t+ 1) — [ei e 2 ] r . Noting that 
e(t) = y(t) — 9(t — 1 ) T (j>(t — d ) 

= y(t) - 9{t - 1 -d)- 0(t - 1 ) T P{t - \)<j>(t - d) 

= e{t) - 0(t - 1 ) T P{t - 1 ) 0 (/ - d) 

Then, the difference of the Lyapunov function candidate becomes 



V(t+1)-V(t) = 



1 + <j>(t - d) T P(t - l)(j>(t-d) 



1 + 0(r -d) T P(t- l)<j>(t-d) 

1 + (1 — a(t))cj)(t — d) 1 P(t — !)</>(? — d) 



x ( rj(t ) — /3(t — 1 ) T P(t — 1 )(j>(t — d)) 2 — e(t)' 



a(t)C(t) T 

(1 - a)(l + (f>{t - d) T P{t - l)0(r - d)) 



sup ||x(r) 

0 <T<t 



a(t) 2 A 

(1 - a) 2 (l + (j){t - d) T P(t - 1 )</»(? - d)) 2 



sup ||.v(r)|| 2 l T [ sup ||.t(t )|| 2 



1 + cj)(t - df P(t - 1 )4>(t-d) 

X ~ Pit - 1 ) T P(t ~ 1 m ~ d)) 2 - e(t) 2 

+ MO (t(0 -7(0) 

(1 - a)(\ + <j){t - d) T P{t - \)(j>{t-d)) 

a(t ) 2 A 

(1 - a) 2 (l + </»(? - d) T P{t - !)</>(? - ^)) 2 



sup ||x(r)|| 2 l T \ sup ||x(t )|| 2 
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d(t) 

1 + 4>{t - df P(t - 1 )<l>(t-d) 

X W^vitf -T^—m - l) r P(r - mt - d )f - e(tf 
l — a 1 — a 



2a(Q(7(0 ~7(0) 

(1 - a)(l + </>(t - d) T P(t - l)(f)(t - d)) 



d(t ) 2 A 


CN 

_K_ 

^ VI 


T 


’ sup ||.v(r)|| 2 ' 

0 <T<t 


(l-a) 2 (l+ </>(?- d) T P{t -\)(t>{t - d )) 2 


1 




1 



< 



d{t) 

1 + (j){t - d) T P{t - !)</>(? - d) 



^ r?(0 2 _ _ W 2 +2 _ + Q{t)) 
1 — a 1 — a 



< 



d(t) 



i + &(t- dyp(t- \)4>{t-d) L 



~e(t) 2 + - ( 7(0 + 0 ( 0 ) 

1 — a 



< 



a(t) 



1 + (j)(t — dy P(t — 1 )4>(t — d) 



— e (t) 2 + T~ZP e (t) 2 
1 — a g 



< 



d(t) 

1 + <j){t - d) T P(t - l)4>(t-d) L 



~ e (t) 2 +^ e (tf 



go 



go - 1 a(Qe(Q 7 

go \ + cj>(t - d) T P{t ~ - d) 



go-i /(g 1/2 (7(0 + g(0) 1/2 ,g(0) 2 
go 1 + </>(? -d) T P{t~ \)(j>{t~d) 



(2.26) 



where the fact that o(t)e(t) 2 = f(t)e(t) >f(t) 2 has been used. Therefore, 
following the same arguments in [26], [11], [21], the results (i)-(iii) in Lemma 
5.1 are thus proved. The properties (iv)-(vi) in the lemma can also be obtained 
directly from the results in [23]. 

If using the same adaptive control law as in equation (2.11), then with the 
parameter estimation properties (i)-(vi), the global stability and convergence 
results of the new adaptive control system can be established as in [26], [11] as 
long as the estimated E\ is small enough, which are summarized in the 
following theorem. 
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Theorem 5.1 The direct adaptive control system satisfying assumptions (Al)~ 
(A3) with the adaptive controller described in equations (2. 19) (2.23) and 
(2.11) is globally stable in the sense that all the signals in the loop remain 
bounded. 



2.6 Simulation example 



In this section, one numerical example is presented to demonstrate the 
performance of the proposed algorithm. A fourth order plant is given by the 
transfer function as 

G(s) = G n (s) G u (s) 



with 



as a nominal part, and 



G n (s) 



5(s + 2) 
s(s+ 1) 



G u (s) 



229 

?T3(bT229 



as the unmodelled dynamics. 

With the sampling period T = 0.1 second, we have the following corre- 
sponding discrete-time model 



G(cf l ) 



0.09784 q-' + 0. 1206c/- 2 - 0.1414f 3 - 0.01878,? 4 
1 - 2.3422c/-' + 1.0788c/- 2 - 0.4906c/- 3 + 0.04505<r 4 



The reference model is chosen as 



Gm(s) 



1 

0.2s + 1 



whose corresponding discrete-time model is 



G m (q~ 1 ) 



0.3935c/- 1 
1 - 0.6065?-' 



We have chosen ko = 1, and 9(0) = [0.6 0 0 0] 7 . If no dead zone is used, 
the simulation results are divergent. If using the algorithm developed in this 
chapter with A = lO -5 , the simulation results are shown as in Figure 2.2, where 
(a) represents the system output v(t) and reference model output y*(t), (b) is 
the control signal u(t), (c) is the estimated parameter 0', and (d) denotes the 
estimated bounding parameters e\ and ej. 

In order to demonstrate the effect of the update rate parameter A, the 
following simulation with A= 1.4 x lO -5 was also conducted. The result is 
shown in Figure 2.3. 

The steady state values of the several important parameters and the tracking 
error in both cases are summarized in Table 2.1. 




Estimated thetal Output and reference <S Estimated thetal 
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Time in seconds Time in seconds 



2.2 Robust adaptive control with A = 10 5 




Time in seconds Time in seconds 

Figure 2.3 Robust adaptive control with A = 1.4 x 10~ 5 
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Table 2.1 Steady state values 





x = 10- 5 


A = 1.4 x 10- 5 


ii 


1.1898 x 10- 3 


1.4682 x 10- 3 


£ 2 


0.3509 x 10- 3 


0.4467 x 10- 3 


0 1 


0.5649 


0.5833 


\y-y*\ 


0.0179 


0.07587 



It can be observed from the above simulation results that the algorithm 
developed in this chapter can guarantee the stability of the adaptive system in 
the presence of the modelling uncertainties, and the smaller tracking error 
could be achieved with smaller update rate parameter A. 

Most importantly, the knowledge of the parameters e\ and £2 of the upper 
bounding function and the knowledge of the leading coefficient of the param- 
eter vector 9 X are not required a priori. 



2.7 Conclusions 

In this chapter, a new robust discrete-time direct adaptive control algorithm is 
proposed with respect to a class of unmodelled dynamics and bounded 
disturbances. Dead zone is indeed used but no knowledge of the parameters 
of the upper bounding function on the unmodelled dynamics and disturbances 
is required a priori. Another feature of the algorithm is that a correction 
procedure is employed in the least squares estimation algorithm so that no 
knowledge of the lower bound on the leading coefficient of the plant numerator 
polynomial is required to achieve the singularity free adaptive control law. The 
global stability and convergence results of the algorithm are established. 
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Adaptive variable structure 

control 

C.-J. Chien and L.-C. Fu 



3.1 Introduction 

In the past two decades, model reference adaptive control (MRAC) using only 
input/output measurements has evolved as one of the most soundly developed 
adaptive control techniques. Not only has the stability property been rigor- 
ously established [17], [19] but also the robustness issue due to unmodelled 
dynamics and input/output disturbance has been successfully solved [15], [18]. 
However, several limitations on MRAC remain to be relaxed, especially the 
problem of unpredictable transient response and tracking performance which 
has recently become one of the challenging research topics in the field of 
MRAC. A considerable amount of effort has been made to improve these 
schemes to obtain better control effects [6], [9], [11], [22]. One effort out of 
several is to try to incorporate the variable structure design (VSD) [9], [11] 
concept into the traditional model reference adaptive controller structure. 
Notably, Hsu and Costa [11] have first successfully proposed a plausible 
scheme in this line, which was then followed by a series of more general results 
[12], [13], [14], Aside from those, Fu [9], [10] has taken up a different approach 
in placing the variable structure design in the overall resulting adaptive 
controller. An offspring of the work [9] and part of the work [12] include 
various versions of results respectively applied to SISO [20], [23], MIMO [2], 
[5], time-varying [4], decentralized [24] and affine nonlinear [3] systems. 

It is well known that a main difficulty for the design of the variable structure 
MRAC system is the so-called general case when relative degree of the plant is 
greater than one. In this chapter, we present a new algorithm to solve the 
variable structure model reference adaptive control for a single input single 
output system with unmodelled dynamics and output disturbances. The design 
concept will be first introduced for relative degree-one plants and then be 
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extended to the general case. Compared with the previous works, which used 
adaptive variable structure design or traditional robust adaptive approaches 
for the MRAC problem, this algorithm has the following special features: 

( 1 ) This control algorithm successfully applies the variable structure adaptive 
controller for the general case under robustness consideration. 

(2) The control strategy using the concept of ‘average control’ rather than that 
of ‘equivalent control’ is thoroughly analysed. 

(3) A systematic design approach is proposed and a new adaptation mechan- 
ism is developed so that the prior upper bounds on some appropriately 
defined but unavailable system parameters are not needed. It is shown that 
without any persistent excitation the global stability and robustness with 
asymptotic tracking performance can be guaranteed. The output tracking 
error can be driven to zero for relative degree-one plants and to a small 
residual set (whose size depends on the level of magnitude of some design 
parameter) for plants with any higher relative degree. Both results are 
achieved even when the unmodelled dynamic and output disturbance are 
present. 

(4) If the aforementioned bounds on the system parameters are available by 
some means before controller design, then with a suitable choice of initial 
control parameters, the output tracking error can even be driven to zero in 
finite time for relative degree-one plants and to a small residual set 
exponentially for plants with any higher relative degree. It is noted that 
these bounds are usually assumed to be known before the construction of 
the variable structure controller or the robust adaptation law. 

In order to make a comparison between the proposed adaptive variable 
structure scheme and the traditional approaches, some computer simulations 
are made to illustrate the differences of the tracking performance. The 
simulations will clearly demonstrate the excellent transient responses as well 
as tracking performance, which are almost never possible to achieve when 
traditional MRAC schemes are employed [19]. 

The theoretical framework in this chapter is developed based on Filippov’s 
solution concept for a differential equation with discontinuous right-hand side 
[8], In the subsequent discussions, the following notations will be used: 

(1) /’(s) [«](?): denotes the filtered version of u{t) with any proper or strictly 
proper transfer function P{s ) . 

(2) | • |: denotes the absolute value of any scalar or the Euclidean norm of any 
vector or matrix. 

(3) II (")/ lloo = sup r <, |(-)(t)|: denotes the truncated L <*, norm of the argument 
function or vector. 

(4) || J P(«)|| 00 : denotes the H x norm of the transfer function P(s) . 

The chapter is organized as follows. In Section 3.2, we give the plant 
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description, control objective and then derive the MRAC based error model. In 
Section 3.3, the adaptive variable structure controller for relative degree-one 
plants is proposed with stability and performance analysis. The extension to 
plants with relative degree greater than one is presented in Section 3.4. Section 
3.5 gives simulation results to demonstrate the effectiveness of the adaptive 
variable structure controller. Finally, a conclusion is made in Section 3.6. 



3.2 Problem formulation 

3.2 . 7 Plant description and control objective 

In this chapter, we consider the following SISO linear time-invariant plant 
described by the equation: 

y P (t) = P(s)(l + pP u (s))[u p ](t) + d 0 (t) (3.1) 

where u p ( t) and y p (t) are plant input and plant output respectively, pP u (s) is 
the multiplicative unmodelled dynamics with some p € R + , and d 0 is the 
output disturbance. Here, P(s) represents the strictly proper rational transfer 
function of the nominal plant which is described by 

p ^ = k >W) <3 ' 2) 

where n p (s) and d p (s) are some monic coprime polynomials and k p is the high 
frequency gain. Now suppose that the plant (3.1) is not precisely known but 
some prior knowledge about the transfer function may be available. The 
control objective is to design an adaptive variable structure control scheme 
such that the output y p {t) of the plant will track the output y m (t) of a linear 
time-invariant reference model described by 

y m {t) = M{s)[r m \{t) = kj^\[r m ]{t) (3.3) 

where M(s) is a stable transfer function and r m (t) is a uniformly bounded 
reference input. In order to achieve such an objective, we need some 
assumptions on the modelled part of the plant and the reference model as 
well as the unmodelled part of the plant. These assumptions are made in the 
following. 

For the modelled part of the plant and reference model: 

(Al) All the coefficients of n p (s) and d p (s) are unknown a priori, but the order 
of P(s) and its relative degree are known to be n and p, respectively. 
Without loss of generality, we will assume that the order of M(s) and its 
relative degree are also n and p , respectively. 
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(A2) The value of high frequency gain k p is unknown, but its sign should be 
known. Without loss of generality, we will assume k p , and hence k m , are 
positive. 

(A3) P(s) is minimum phase, i.e. all its zeros lie in the open left half complex 
plane. 

For the unmodelled part of the plant: 



(A4) The unmodelled dynamics P u {s — k\) is a strictly proper and stable 
transfer matrix such that |D| < a\, ||(.P„(,y - k\)s — D)(s + n 2 )|loo < «i> 
for some constants a\,ai > 0, where D — lim.^oo P u (s)s and 
11*00 II oo = su Pwe« 1*0)1 I 15 !- 

(A5) The output disturbance is differentiable and the upper bounds on 



\d 0 {t) |, 




exist. 



Remark 3.1 

• Minimum-phase assumption (A3) on the nominal plant P(s') is to guarantee 
the internal stability since the model reference control involves the cancella- 
tion of the plant zeros. However, as commented by [15], this assumption 
does not imply that the overall plant (3.1) possesses the minimum-phase 
property. 

• The latter part of assumption (A4) is simply to emphasize the fact that P„(s) 
are uncorrelated with p in any case [16]. The reasons for assumption (A5) 
will be clear in the proof of Theorem 3.1 and that of Theorem 4.1. 



3.2.2 MRAC based error model 

Since the plant parameters are assumed to be unknown, a basic strategy from 
the traditional MRAC [17], [19] is now used to construct the error model 
between y p {t) and y m (t). Instead of applying the traditional MRAC technique, 
a new adaptive variable structure control will be given here in order to pursue 
better robustness and tracking performance. Let (3.1) be rewritten as 



y P (t ) - d 0 (t) 



P(s) u p + pP u (s)[u p \ (t) = P(s)[u p + u](t) 



(3.4) 



then from the traditional model reference control strategy [19], it can be shown 
that there exists 0* = [0| , . . . , 9 2 „] T € R 2 " suc h that if 

a(s) 






\{s) 



d(s) 



D}(s) = [e* n ,e: +l ,...,e* 2n _ 2 ]^ + e* 2n _ l 



where a(s) = [1, s, . . . ,s n 2 ] T and A (s) is an /7th order monic Hurwitz 
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(3.5) 



polynomial, we have 

1 - D* b (s) - D}(s)P(s) = e* 2n M~\s)P{s) 

Applying both sides of (3.5) on u p + u, we have 

u p (t)+u(t)-D* b (s)[u p + u](t)-Df(s)[y p -d o ](t) = 0* 2n M~ l (s)[y p -d o ](t) (3.6) 

so that 

y P (t) - d 0 (t) = M(s)d* 2 ~ l u P + u - D* b (s)[u p + u] - D* f {s)[y p - d 0 ] (t) (3.7) 

Since 

D* h (s)[u p + u]{t) + D}(s)[y p - d 0 ](t) + 6* 2 n r m (t) 



= 0 * 



a(s ) 
\{s) 



T7 M"P+m 



= 0 * 



^0* T w(f)-0*'wrf o (;)+i^)[w]W 



W)^~ m 

y P {t) - d 0 {t) 

( 0 

a(s) 

W)' 






a(s ) , ... 
A(^ [j/;](?) 

y P (t) 

I'm ( i ) 





0 


-0* T 






d 0 {t) 




0 



Dl{s)[u){t) 



(3.8) 



we have 

y P {t) - d 0 {t) = M(s)0* 2 ~ l [ii P - 0* T w + &* T Wd„ + (1 - Z>j(5))[m] + 0* 2 n r m \(t) 

= M(s)0* 2 - l [u p - 0* T n’ + 0* T nv„ + pA (s)[u p ) + e* 2n r m ]{t) (3.9) 

where A(s) = (1 - D* b (s))P u (s) = ^1 - + ' ' — j^»C?)- lf we define 

the tracking error eo (?) as y p (t) — y m (t), then the error model due to the 
unknown parameters, unmodelled dynamics and output disturbances can be 
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readily found from (3.3) and (3.9) as follows: 

e 0 (t) = M(s)d* 2 ~ l u p - 0 * T w + 0* T w do + p,A(s)[u p ] (?) + d 0 {t) (3.10) 

In the following sections, the new adaptive variable structure scheme is 
proposed for plants with arbitrary relative degree. However, the control 
structure is much simpler for relative degree-one plant, and hence in Section 
3.3 we will first give a discussion for this class of plants. Based on the analysis 
for relative degree-one plants, the general case can then be presented in a more 
straightforward manner in Section 3.4. 



3.3 The case of relative degree one 



When P{s) is relative degree one, the reference model M(s) can be chosen to be 
strictly positive real (SPR) (Narendra and Annaswamy, 1988). The error model 
(3.10) can now be rewritten as 

e 0 (t) = M(s)Q* 2 ~ l u p - Q* T w + Q* T w do +6* 2n M~ l (s)[d 0 ]+ p.A(s)[u p \ (?) 



In the error model (3.11), the terms 0* T w, Q* T w do + 92 n M~ l {s)[d 0 ] and 
pA(s)[up] are the uncertainties due to the unknown plant parameters, output 
disturbance, and unmodelled dynamics, respectively. Let ( A m ,B mi C m ) be any 
minimal realization of M(s)9t n l which is SPR, then we can get the following 
state space representation of (3.11) as: 

e{t) = A m e(t)+ B m (u p (t)-Q* r w(?) + @* T Wd o (t)+0* 2n M~ l (j) [</„](?)+ p,A(s)[u p \(t)) 

e 0 (t) = C m e(t) (3.12) 

where the triplet (A,„, B m , C m ) satisfies 

P m A m + A^Pm = —2 Q m ] P m Bm = Cl (3.13) 

for some P m = Pi > 0 and 0,„ = Qj n > 0. 

The adaptive variable structure controller for relative degree-one plants is 
now summarized as follows: 

( 1 ) Define the regressor signal 

W’M= C ^Wp\(t), C ^[yp\(t),yp(t),rm(t) = [h’i(?),H’ 2 (?),...,H'2„(?)] T 



(3.14) 
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and construct the normalization signal m(t) [15] as the state of the 
following system: 

m(t) = —6om(t) + 8\{\u p (t)\ + 1), m(0) > ^ (3.15) 

°o 

where ho, hi > 0 and ho + S 2 < min (k \ , k 2 ) for some 8 2 > 0. The parameter 
k 2 > 0 is selected such that the roots of A(,s — k 2 ) lie in the open left half 
complex plane, which is always achievable. 

(2) Design the control signal «,,(?) as 

In 

Up(t ) = y>sgn (e 0 Wj)6j(t)wj(t)) - sgn (e o )A(0 - sgn (e 0 )f3 2 (t)m(t) (3-16) 
7=1 

f 1 if eo > 0 
sgn (e 0 ) = < 0 if e 0 = 0 
[ - 1 if <?o < 0 

(3) The adaptation law for the control parameters is given as 

0j{t) = lj\eQ(t)wj(i)\, 7=1,..., 2 n 
A(0 = g\\eo{t)\ 

fait) = g 2 \e 0 (t)\m(t) (3.17) 

where 7y,gi,^2 > 0 are the adaptation gains and 9j( 0),fli(0),fi 2 (0) > 0 (in 
general, as large as possible) 7 = 1, . . . ,2 n. 

The design concept of the adaptive variable structure controller (3.15) and 
(3.16) is simply to construct some feedback signals to compensate for the 
uncertainties because of the following reasons: 

• By assumption (A5), it can be easily found that |0* T itv o (t)+ 
9* 2n M(sy l [d 0 \(t)\ < P\ for some f3\ > 0. 

• With the construction of in, it can be shown [15] that pA(s)[u p ](t) < ffim(t), 
Vt > 0 and for some constant ff 2 > 0. 

Now, we are ready to state our results concerning the properties of global 
stability, robust property, and tracking performance of our new adaptive 
variable structure scheme with relative degree-one system. 

Theorem 3.1 (Global Stability, Robustness and Asymptotic Zero Tracking 
Performance) Consider the system (3.1) satisfying assumptions (A1)-(A5) with 
relative degree being one. If the control input is designed as in (3.15), (3.16) and 
the adaptation law is chosen as in (3.17), then there exists p* > 0 such that for 
p£ [0,//] all signals inside the closed loop system are bounded and the 
tracking error will converge to zero asymptotically. 
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Proof: Consider the Lyapunov function 



2« | 2 , 

V a = \e^P m e (ej-\0}\) 2 + ^2 — (/ 3j - (3*f 



2 9i 



where P m satishes (3.13). Then, the time derivative of V a along the trajectory 
(3.12) (3.17) will be 

V a = ~e r Q m e + e 0 (u p - 0* t h- + Q* T w do + 0* 2n M~ l (s)[d 0 \ + pA(s)[u p ]) 

2 « 1 2 . 

+ E-(^-iw + Et(^-W 

7=1 ^ 7=1 yj 

2 n 

< ~e T Q m e - ^2 \ e o w j\(fy ~ \ e *j\) ~ M(/?i - #) - ko|(/?2 - /^)w 
7=1 

2 « 1 2 , 

+Er(^-iW+ET(^-W 

7=1 iJ 7=1 »7 

, I |2 



for some constant q m > 0. This implies that e € L2 n Loo and 0jj = 
1, . . . ,2«,/3i,/?2,co G Lqo and, hence, all signals inside the closed loop system 
are bounded owing to Lemma A in the Appendix. On the other hand, it can be 
concluded that e G L x by (3.12). Hence, e G L2 fl L x and e G L 00 readily imply 
that e and eo will at least converge to zero asymptotically by Barbalat’s lemma 
[19], Q.E.D. 

In Theorem 3.1, suitable integral adaptation laws are given to compensate 
for the unavailable knowledge of the bounds on \6* and fi* . Theoretically, the 
adaptive variable structure controller will stabilize the closed loop system with 
guaranteed robustness and asymptotic zero tracking performance no matter 
what 0/(O)’s and /3,(0)’s are. However, according to the following Theorem 3.2, 
we will expect that positive and large values of 0/(0), /3 7 (0) should result in 
better transient response and tracking performance, especially when 
0,(0) > \0*\,Pj(0) > (3*. 

Theorem 3.2 (Finite-Time Zero Tracking Performance with High Gain 
Design) Consider the system set-up in Theorem 3.1. If 0/(0) > 
\9j\,/3j(0) > /3j, then the output tracking error will converge to zero in Unite 
time with all signals inside the closed loop system remaining bounded. 

Proof Consider the Lyapunov function Vb = \e T P m e where P,„ satisfies 
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(3.13). The time derivative of V/, along the trajectory (3.12) becomes 

2 n 

V h = ~e T Q m e - Y \eoWj\(Oj - \0*\) - |e 0 |(A - P\) ~ ko \(Pi - PV)m 

7=1 

< -e T Q,„e 

< -k^Vb 

for some A '3 > 0 since 9j(t) > \9J\, pj(t) > Pj ,\/t > 0. This implies that e 
approaches zero at least exponentially fast. Furthermore, by the fact that 

e 0 e 0 = e 0 {C m A m e + C,„B m (u p - 0* T w + Q* T w do + 6* 2n M~ l (s)[d 0 ] + pA(s)[u p ])} 

2 n 

< k 4 \e 0 \\e\ - Y ko Wj\(9j - \6*\) - \e 0 \(Pi - P\) - \e 0 \{Pi - P* 2 )m 

7=1 

/ 2 n 

< k 4 \e 0 \\e\ - ko| ( Y I w i\( 6 i ~ \°j D + (A “ P*i) + (fo ~ Pi) m 

V 7=l 

where k 4 = \C m A m \, and that \e\ approaches zero at least exponentially fast, 
there exists a finite time T\ > 0 such that eo^o < —k 5 |e 0 | for all t > T\ and for 
some k 5 > 0. This implies that the sliding surface e 0 = 0 is guaranteed to be 
reached in some finite time 73 > T\ > 0. Q.E.D. 

Remark 3.2: Although theoretically only asymptotic zero tracking perform- 
ance is achieved when the initial control parameters are arbitrarily chosen, it is 
encouraged to set the adaptation gains jj and g f in (3.17) as large as possible. 
This is because the large adaptation gains will provide high adaptation speed 
and, hence, increase the control parameters to a suitable level of magnitude so 
as to achieve a satisfactory performance as quickly as possible. These expected 
results can be observed in the simulation examples. 



3.4 The case of arbitrary relative degree 

When the relative degree of (3.1) is greater than one, the controller design 
becomes more complicated than that given in Section 3.3. The main difference 
between the controller design of a relative degree-one system and a system with 
relative degree greater than one can be described as follows. When (3.1) is 
relative degree-one, the reference model can be chosen to be strictly positive 
real (SPR) [19]. Moreover, the control structure and its subsequent analysis of 
global stability, robustness and tracking performance are much simpler. On the 
contrary, when the relative degree of (3.1) is greater than one, the reference 
model M (s') is no longer SPR so that the controller and the analysis technique 
in relative degree-one systems cannot be directly applied. In order to use the 
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similar techniques given in Section 3.3, the adaptive variable structure 
controller is now designed systematically as follows: 



(1 ) Choose an operator L\(s) = (\ (s) . . . f p -\ (s) = (s + cci) . . . (s + a p _i) such 

that M(s)L l (s) is SPR and denote Lj(s) = £ ( (s) . . . i = 

2 , • • • j p — 1 ,L p (s) = 1 . 

(2) Define augmented signal 



y«(t) = M(j)Z-i(j) 





(0 



and auxiliary errors 

e a \ (t) = e 0 (t)+y a (t) 
e a i{t) = ^y[“2](0 - 

e a3 (0 = ^yN(0 - 



1 



F{ts) 

1 

cm 



MM 

MM 



(3.18) 

(3.19) 

(3.20) 



*-afi (l) — 



■P - 1 



w 



MM - 



F(ts) 



M- 1](0 



(3.21) 



where 



1 2 

Uj) (?) is the average control of w,(?) with F(ts) = (ts + 1 ) , r 



MM 



being small enough. In fact, F(ts) can be any Hurwitz polynomial in ts 

1 



with degree at least two and F(0) = 1. In the literature, 



F(ts) 



is referred to 



as an averaging filler, which is obviously a low-pass filter whose bandwidth 
can be arbitrarily enlarged as r — > 0. In other words, if r is smaller and 

smaller, the filter — *— is flatter and flatter. 

F{rs) 

(3) Design the control signals u p , Uj, and the bounding function m as follows: 

2 n 

mi (t) = ^(-sgn (e a i tj)0j(t)£j(t)) - sgn (e fl i)AM - sgn (e al )/3 2 (t)m(t) 

7=1 



MO = -sgn ( e ai ) 

tip (t) — Mp(t) 



(t- tW 



F(ts) 



k-i](0 



+ V , i = 2, • • • , p 



(3.22) 

(3.23) 

(3.24) 
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with jy > 0 and 

- 55) mw 

The bounding function m(t) is designed as the state of the system 

m(t) = - 6om(t ) + 8 \{\u p {t)\ + 1), m( 0) > ^ (3.25) 

£>o 

with S 0 , Si > 0 and <5 0 + 62 < . . . ,a p - 1) for some 6 2 > 0. 

(4) Finally, the adaptation law for the control parameters 6 j,j = 1, . . . , 2 n and 
/?i,/?2 are given as follows: 

0j(t) = , j=\....,2n (3.26) 

/3i(0 = 9i\e a \(t)\ (3.27) 

kit) = g 2 \e al (t)\m(t) (3.28) 



with 9j( 0) > 0, /3y(0) > 0 and 7 / > 0, g, > 0. 

In the following discussions, the construction of feedback signals £(t), m(t) and 
the controller (3.22) (3.23) will be clear. 

In order to analyse the proposed adaptive variable structure controller, we 
first rewrite the error model (3.10) as follows: 

eo(t) = M{s)[u p - e* 2 ~ l Q* T w + 9* 2 - { Q* T w do + pA(s)[u p ] 

+ (^2 n * _ 1 ) u p]( t ) + d 0 {t) 

= M{s)Li{s) [u p ] 02 , 7 * [Q* T w do +6* 2n M-\s) [d 0 \] 

+ + (1 - ^n>p] (■ t ) (3.29) 

Now, according to the design of the above auxiliary error (3.18) and error 
model (3.29), we can readily find that e a \ always satisfies 

e a \{t) = M{s)L\{s) u\ - e* 2 - x Q* T ^ + ^^[Q* T w d0 + 0* 2n M~\s)[d o ]] 

+ [mA(j)[m p ] + (1 - 0 2 n )u p ] (t) (3.30) 

It is noted that the auxiliary error e a \ is now explicitly expressed as the output 
term of a linear system with SPR transfer function M{s)L\ (s) driven by some 
uncertain signals due to unknown parameters, output disturbances, un- 
modelled dynamics and unknown high frequency gain sign. 
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Remark 4.1 The construction of the adaptive variable structure controller 
(3.22) is now clear since the following facts hold: 



• Since 



9 



.-i 

2 n 



L\(s) L 
we have 



Q* T Wdo + 9* ln M 1 (s') [d 0 \ (?) is uniformly bounded due to (A5), 



o*-i r 

? 2n 



L\{s) L 



e*'w d0 + e* 2n M-\s)[d 0 ]\(t) 



<ft 



(3.31) 



for some (3\. 

• With the design of the bounding function m(f) (3.25), it can be shown that 



iK 1 r 

U {s) L ; 



p,A(s)[u p \ + (l - 9* ln )u p (?) 



< /?2 m{t) 



(3.32) 



for some ff 2 > 0. 

The results described in Remark 4.1 show that the similar techniques for the 
controller design of a relative degree-one system can now be used for auxiliary 
error e a \. But what happens to the other auxiliary errors e a 2 , . . . , e ap , especially 
the real output error e<, as concerned? In Theorem 4.1, we summarize the main 
results of the systematically designed adaptive variable structure controller for 
plants with relative degree greater than one. 

Theorem 4.1 (Global Stability, Robustness and Asymptotic Tracking 
Performance) Consider the nonlinear time-varying system (3.1) with relative 
degree p > 1 satisfying (A1)-(A5). If the controller is designed as in (3.18)- 
(3.25) and parameter update laws are chosen as in (3.26)-(3.28), then there 
exists t* > 0 and p* > 0 such that for all r € (0,r*) and p e (0 ,p*), the 
following facts hold: 

(i) all signals inside the closed-loop system remain uniformly bounded; 

(ii) the auxiliary error e a \ converges to zero asymptotically; 

(iii) the auxiliary errors e a j, i = 2, . . . , p, converge to zero at some finite time; 

(iv) the output tracking error eo will converge to a residual set asymptotically 
whose size is a class K function of the design parameter r. 



Proof The proof consists of three parts. 

Part I Prove the boundedness of e a j and 6 1 , . . . , $ 2 «, A , /32- 

Step 1 First, consider the auxiliary error e a \ which satisfies (3.30). Since 




Adaptive Control Systems 53 



M(s)Li(s) is SPR, we have the following realization of (3.20) 

ei =A l e 1 +B l (ip - (^ 1 0* T ^ + [& T w do + 0^M~ l [s)[d„]] 

+ [MA(s)[«p] + (1 - 9* 2n )u p }^j 

e a \ = C\e\ (3.33) 

with P\A\ +A\P\ = — 2Q\, P\B\ = Cj for some P\ = P\ >0 and Q\ = 
Qj > 0. Given a Lyapunov function as follows: 

2 « I / 9* \ 2 2 1 

" = + W.) <3M) 

it can be shown by using (3.32) and (3.31) that 

Vi = -ejQ iei +e al (ux-ei- l Q* T i + ^^[Q* T w do + e* 2n M-\s)[d 0 ]} 



+ ^ s ) + (1 ” ^2 n) U p]j 

2n i / Q* \ 2 I 

+Ez («,- ^ H+E^-W 

/=! 7y V p 2« / /= i Uj 



In / Q* \ 

<-e{Qiei-J2\eai§\[0j- Jr ) ~ \e a i\{0i ~ ft) ~ \e a i \(02 ~ 0* 2 )m 

j= 1 V P 2h / 

2 « 1 / 9* \ . 2 I 

+ ~ 0T H + 

y=i 7/ V ( h>„ / /= i 0/ 

= -e}giei 

< -q\\e\\ 2 

for some q\ > 0 if the controller in (3.22) and update laws in (3.26) (3.28) are 
given. This implies that 02, i, Pi, 02 G Loo and e a \ G L 2 D L x . 

Step 2 From (3. 19) (3.2 1 ), we can find that e a 2 , . . . ,e ap satisfy 

A(j) , i 

e a 2 — —Ol \e a 2 + U-2 — r Ml 

F{ts) 



, ^ p~ 1 ($) r l 

&ap — &p—\£ap H - Up ~F~(^ 1 




54 Adaptive variable structure control 



Now by the following facts that for i = 2,...,p: 






— &ai I C%i—\£ai H” Mi r-,/ \ [M-i— \ 



F( TS ) 



= -Ui-ie 2 ai + e ai { - sgn{e ai ) 



F(ts) 



[Ui- 1 



ti- t(j) 
F(rs) 



[ut - 1 



or 



dt 



-oti-i \e ai \ - T] 



(3.35) 



when |e ai | 7 b 0. This implies that e a j will converge to zero after some finite time 
T > 0. 



Part II Prove the boundedness of all signals inside the closed loop system. 

Define e ai = M{s)L i _ l {s)[e a j\, i = 2, . . , , p and E a = e aX + e a2 4 b e ap 

which is uniformly bounded due to the boundedness of Then, from 
(3. 1 8) (3.2 1 ), we can derive that 



E a 



e 0 + M(s)Li(s) 


h z, <^) 




' 1 1 


+ M(s)Li(s) 




+ M(s)L 2 (s) 


L(,) w F(„) W ] 



+ M(s)L p _\(s) 



= e 0 + R 



Ms) [Up] F(rs) [Up - l] 
1 



U ' + W) [U2] 



1 

Si(s) ■ .Jp-l{s) 



Wp- 1 ] 



(3.36) 



Now, since IKm,-),^ < & 6 ||( e o),|loc + h, i = 1, . . . ,p - 1 for some k 6 > 0 by 
Lemma A in the appendix, it can be easily found that 



1 



ll\ 



A (s) 



M 



1 



A {s)..Jp- 2 {s) 



[ w p-l] 



< ^7||(e 0 )J| oo + kj 
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for some k-/ > 0. Furthermore, since the H ^ norm of || ' ( 1 — lloo = 2r an< ^ 
||,sM(,s)Li(,y)|| 00 = kg for some kg > 0, we can conclude that 

|| sm(j)li(j) || (^nwjioo +ki) 

< t(/:9||(c 0 ),IIoo + A '9) 

for some k 9 > 0. Now from (3.36) we have 

IIMJL < IKML + IIWJloo < IK^JL + r^nwjLH-^) 

which implies that there exists a r* > 0 such that 1 — r*k 9 > 0 and for all 

IIMJL < ll( ^^°° + r/r9 (3.37) 

Combining Lemma A and (3.37), we readily conclude that all signals inside the 
closed loop system remain uniformly bounded. 

Part III: Investigate the tracking performance of e a \ and e 9 . 

Since all signals inside the closed loop system are uniformly bounded, we 
have 

Gl £ L 2 n Loo, t-Q 1 C Lqo 

Flence, by Barbalat’s lemma, e a \ approaches zero asymptotically and 
E a = e„i + e a 2 + ■ ■ ■ + e ap also approaches zero asymptotically. Now, from 
the fact of (3.37) and E a approaching zero, it is clear that eo will converge to 
a small residual set whose size depends on the design parameter r. Q.E.D. 
As discussed in Theorem 3.2, if the initial choices of control parameters 

9 * 

6j(0),Pj(0) satisfy the high gain conditions 0/(0) > -L- and /3j( 0) > (3*, then, 

“in 

by using the same argument given in the proof of Theorem 3.2, we can 
guarantee the exponential convergent behaviour and finite-time tracking 
performance of all the auxiliary errors e ai . Since e m reaches zero in some 

finite time and E a = e a \ + e a 2 + 1- e ap , it can be concluded that E a converges 

to zero exponentially and e 9 converges to a small residual set whose size 
depends on the design parameter r. We now summarize the results in the 
following Theorem 4.2. 

Theorem 4.2: (Exponential Tracking Performance with High Gain Design) 

Consider the system set-up in Theorem 4.1. If the initial value of control 

9 * 

parameters satisfy the high gain conditions 0/(0) > — and (3j( 0) > fl*, then 

“l n 

there exists a r* and p* such that for all rG (0,r*] and p G (0,^*], the 
following facts hold: 

(i) all signals inside the closed loop system remain bounded; 
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(ii) the auxiliary errors e ai , i — 1 ,p, converge to zero in finite time; 

(iii) the output tracking errors eo will converge to a residual set exponentially 
whose size depends on the design parameter r. 



Remark 4.2: It is well known that the chattering behaviour will be observed in 

the input channel due to variable structure control, which causes the 
implementation problem in practical design. A remedy to the undesirable 
phenomenon is to introduce the boundary layer concept. Take the case of 
relative degree one, for example, the practical redesign of the proposed 
adaptive variable structure controller by using boundary layer design is now 
stated as follows: 



Up 




(eoWj)Oj{t)wj(t) 



ttOo) 



sgn(>o) if|e 0 |>£ 

— if kol < £ 
£ 



Tr(e 0 )Pi{t) - n(eo)P 2 (t)m(t) 



(3.38) 



for some small e > 0. Note that 7r(co) is now a continuous function. However, 
one can expect that the boundary layer design will result in bounded tracking 
error, i.e. eo cannot be guaranteed to converge to zero. This causes the 
parameter drift in parameter adaptation law. Hence, a leakage term is added 
into the adaptation law as follows: 

0j{t) = lj\ea{t)Wj{t)\ - aOj(t), j=l,...,2n 

A(0 = 0iMOI - cr/?i(0 

Pi(t) = g 2 \e 0 (t)\m(t) - a(3 2 (t) (3.39) 



for some a > 0. 



3.5 Computer simulations 

The adaptive variable structure scheme is now applied to the following 
unstable plant with unmodelled dynamics and output disturbances: 

»<') = ji + Z +,-2 (‘ + °' 01 ?Tto) M (') + 0 - 05si "( 5 ') 

Since the nominal plant is relative degree three, we choose the following steps 
to design the adaptive variable structure controller: 
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• reference model and reference input: 



M(s) 

Fm (t) 



8 

0 + 2) 3 

f 2 if t < 5 
1-2 if 5 < t < 10 



• design parameters: 

L\ (j) = ^i(sy 2 {s),^i(s) = s+ l,f 2 (s) =5 + 2 
A(5) = (5+l) 2 

F(rs) = (4 s + ') 



• augmented signal and auxiliary errors: 

y a (t) = M(s)Li(s) m 0) 

e a \ (0 = e 0 (t) + v a (t) 

t * 3(,) = ?7t) [ "-' l( ' ) -FR [ " !|( ' ) 

• controller: 

«i(0 = 5^^-sgn (e al £j)0j(t)£j(t)'J ~ sgn 0„i)AO) - sgn (e al )/3 2 (t)m(t) 
Ui(t ) = -sgn (e ai ) ^ [».-i]0) +1^, i= 2,3 

ttp (j) Up(t) 

m(t ) = — m(t ) + 0.005(|w /; (/)| + 1), m( 0) = 0.2 

• adaptation law: 

0/0) = 7y|e a i(00(0l- J= !,•••, 6 

AO) = 0i|e fl i(/)l 

/? 2(0 = 6f 2 |e a i(/)|m(/) 



Three simulation cases are studied extensively in this example in order to verify 
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Figure 3.1 y p {-),y m { — ), time (sec) 




all the theoretical results and corresponding claims. All the cases will assume 
that there are initial output error j p (0) — _y m (0) = 4. 

( 1 ) In the first case, we arbitrarily choose the initial control parameters as 

0/(0) =0.1, j = 1 — ,6 
$(0) =0.1, j = 1,2 

and set all the adaptation gains -y y = g- = 0.1. As shown in Figure 3.1 (the 
time trajectories of y p and y m ), the global stability, robustness, and 
asymptotic tracking performance are achieved. 

(2) In the second case, we want to demonstrate the effectiveness of a proper 
choice of 0j( 0) and /3j( 0) and repeat the previous simulation case by 
increasing the values of the controller parameters to be 

0/(0) = 1, 7=1,- -.,6 
A(0)=1, 7=1,2 

The better transient and tracking performance between y p and y m can now 
be observed in Figure 3.2. 
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Figure 3.3 y p ( — ), time (sec) 



(3) As commented in Remark 3.2, if there is no easy way to estimate the 
suitable initial control parameters (9/(0) and /3/(0) like those in the second 
simulation case, it is suggested to use large adaptation gains in order to 
increase the adaptation rate of control parameters such that the nice 
transient and tracking performance as described in case 2 can be retained 
to some extent. Hence, in this case, we use the initial control parameters 
as in case 1 but set all the adaptation gains to jj = gj = 1. The 
expected results are now shown in Figure 3.3, where rapid increase of 
control parameters do lead to satisfactory transient and tracking 
performance. 



3.6 Conclusion 

In this chapter, a new adaptive variable structure scheme is proposed for model 
reference adaptive control problems for plants with unmodelled dynamic and 
output disturbance. The main contribution of the chapter is the complete 
version of adaptive variable structure design for solving the robustness and 
performance of the traditional MRAC problem with arbitrary relative degree. 
A detailed analysis of the closed-loop stability and tracking performance is 
given. It is shown that without any persistent excitation the output tracking 
error can be driven to zero for relative degree-one plants and driven to a small 
residual set asymptotically for plants with any higher relative degree. 
Furthermore, under suitable choice of initial conditions on control parameters, 
the tracking performance can be improved, which are hardly achievable by the 
traditional MRAC schemes, especially for plants with uncertainties. 
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Appendix 



Lemma A Consider the controller design in Theorem 3.1 or 4.1. If the control 
parameters 9j(t),j = 1, . . . ,2n,(3i(t) and are uniformly bounded Vi, then 
there exists p* > 0 such that u p (t) satisfies 

IIMrIL < + k (A.l) 

with some positive constant k > 0. 

Proof Consider the plant (3.1) which is rewritten as follows: 

y{t) - d 0 (t) = P(s)( 1 + pP u (s))[u p \(t) (A.l) 

Let f(s) be the Hurwitz polynomial with degree n — p such that f(s)P(s) is 
proper, and hence, f~ l (s)P~ l (s) is proper stable since P(s) is minimum phase 
by assumption (A3). Then 

y(t) - d 0 (t) = P(s)f(s)f~ l (s)( 1 + pP u (s))[u p \(t) (A. 3) 

which implies that 

f-\s)p-\s)[y-d 0 \(t) - pf-\s)P u (s)[u p ](t) =f-\s)[u p ](t)±u*(t) (A.4) 

Since /“'(s),? -1 ^) and f~ l (s)P u (s) are proper or strictly proper stable, we can 
find by small gain theorem [7] that there exists p* > 0 such that 

HO*), Hoc < «IIOv)/lloo + K < K IIOo),lloo + K ( A - 5 ) 

for some suitably defined k > 0 and for all p G [0, p*]. Now if we can show that 

IIOOJIoo < k|IO*),IIoo + k ( a - 6 ) 

for some k > 0, then (A.l) is achieved. By using Lemma 2.8 in [19], the key 
point to show the boundedness between u p and u* in (A. 6) is the growing 
behaviour of signal u p . The above statement can be stated more precisely as 
follows: if u p satisfies the following requirement 

\u p (ti)\ > c\u p (t\ + T)\ (A.7) 



where t\ and t\ + T are the time instants defined as 

[ft) t\ + T] C = {t | \u p \ = || (M;,),|| 00 } 



(A-8) 



and c is a constant e (0, 1), then u p will be bounded by u * , i.e. (A. 6) is achieved. 
Now in order to establish (A.7) and (A. 8), let ( A p ,B p ,C p ) and (A, B) be the 

state space realizations of P(y)(l + pP„(s)) and respectively. Also 

\(s) 
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define S = [xj, wj , wJ,m] T . Then, using the augmented system 



~ x p ~ 




r ^ 


0 


0 


W 1 




0 


A 


0 


W2 




BCp 


0 


A 


m 




0 


0 


0 





V 




BpUp 




W i 

W>2 


+ 


Bu p 

Bd 0 




m 




+ 

It 



Since d 0 is uniformly bounded, we can easily show according to the control 
design (3.16) or (3.24) that there exists n such that 

|5|<k||(S) ( ||oo + « 



This means that S' is regular [21] so that x p ,wi,W 2 ,m,y p and u p will grow at 
most exponentially fast (if unbounded), which in turn guarantees (A. 7) and 
(A. 8) by Lemma 2.8 in [19]. This completes our proof. Q.E.D. 
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Indirect adaptive periodic 
control 



D. Dimogianopoulos, R. Lozano and A. Ailon 



Abstract 

In this chapter a new indirect adaptive control method is presented. This 
method is based on a lifted representation of the plant which can be stabilized 
using a simple perfonnant periodic control scheme. The controller parameter’s 
computation involves the inverse of the controllability/observability matrix. 
Potential singularities of this matrix are avoided by means of an appropriate 
estimates modification. This estimates transformation is linked to the covar- 
iance matrix properties and hence it preserves the convergence properties of the 
original estimates. This modification involves the singular value decomposition 
of the controllability/observability matrix’s estimate. As compared to previous 
studies in the subject the controller proposed here does not require the frequent 
introduction of periodic n-length sequences of zero inputs. Therefore the new 
controller is such that the system is almost always operating in closed loop 
which should lead to better performance characteristics. 



4.1 Introduction 

The problem of adaptive control of possibly nonminimum phase systems has 
received several solutions over the past decade. These solutions can be divided 
into several different categories depending on the a priori knowledge on the 
plant, and on whether persistent excitation can be added into the system or not. 

Schemes based on persistent excitation were proposed in [1], [11] among 
others. This approach has been thoroughly studied and is based on the fact 
that convergence of the estimates to the true plant parameter values is 
guaranteed when the plant input and output are rich enough. Stability of the 
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closed loop system follows from the unbiased convergence of the estimates. 
The external persistent excitation signal is then required to be always present, 
therefore the plant output cannot exactly converge to its desired value because 
of the external dither signal. This difficulty has been removed in [2] using a self- 
excitation technique. In this approach excitation is introduced periodically 
during some pre-specified intervals as long as the plant state and/or output 
have not reached their desired values. Once the control objectives are 
accomplished the excitation is automatically removed. Stability of these type 
of schemes is guaranteed in spite of the fact that convergence of the parameter 
estimates to their true values is not assured. This technique has also been 
extended to the case of systems for which only an upper bound on the plant 
order is known in [12] for the discrete-time case and in [13] for the continuous- 
time case. 

Since adding extra perturbations into the system is not always feasible or 
desirable, other adaptive techniques not resorting to persistent exciting signals 
have been developed. Different strategies have been proposed depending on the 
available information on the system. 

When the parameters are known to belong to given intervals or convex sets 
inside the controllable regions in the parameter space, the schemes in [9] or [10] 
can be used, respectively. These controllers require a priori knowledge of such 
controllable regions. An alternative method proposes the use of a pre-specified 
number of different controllers together with a switching strategy to commute 
among them (see [8]). This method offers an interesting solution for the cases 
when the number of possible controllers in the set is finite and available. In the 
general case the required number of controllers may be large so as to guarantee 
that the set contains a stabilizing controller. 

In general, very little is known about the structure of the admissible regions 
in the parameter space. This explains the difficulties encountered in the search 
of adaptive controllers not relying on exciting signals and using the order of the 
plant as the only a priori information. In this line of research a different 
approach to avoid singularities in adaptive control has been proposed in [6] 
which only requires the order of the plant as available information. The 
method consists of an appropriate modification to the parameter estimates 
so that, while retaining all their convergence properties, they are brought to the 
admissible area. The extension of this scheme to the stochastic case has been 
carried out in [7]. The extension of this technique to the minimum phase 
multivariable case can be found in [4], This method does not require any a 
priori knowledge on the structure of the controllable region. It can also be 
viewed as the solution of a least-squares parameter estimation problem subject 
to the constraint that the estimates belong to the admissible area. The 
admissible area is defined here as those points in the parameter space whose 
corresponding Sylvester resultant matrix is nonsingular. The main drawback of 
the scheme presented in [6] is that the number of directions to be explored in 
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the search for an appropriate modification becomes very large as the order of 
the system increases. This is due essentially to the fact that the determinant of 
the Sylvester resultant matrix is a very complex function of the parameters. 

The method based on parameter modification has also been previously used 
in [3] for a particular lifting plant representation. The plant description 
proposed in [3] has more parameters than the original plant, but has the 
very appealing feature of explicitly depending on the system’s controllability 
matrix. Indeed, one of the matrix coefficients in the new parametrization turns 
out to be precisely the controllability matrix times the observability matrix. 
Therefore, the estimates modification can actually be computed straightfor- 
wardly without having to explore a large number of possible directions as is the 
case in [6]. It actually requires one on-line computation involving a polar 
decomposition of the estimate of the controllability matrix. However, no 
indication was given in [3] on how this computation can be effectively carried 
out. Recently, [14] presented an interesting direct adaptive control scheme for 
the same class of liftings proposed in [3]. As pointed out in [14] the polar 
decomposition can be written in terms of a singular value decomposition (SVD) 
which is more widely known. Methods to perform SVD are readily available. 
This puts the adaptive periodic controllers in [3] and [14] into a much better 
perspective. At this point it should be highlighted that even if persistent 
excitation is allowed into the system, the presented adaptive control schemes 
offer a better performance during the transient period by avoiding singularities. 

The adaptive controller proposed in [3] and [14] is based on a periodic 
controller. A dead-beat controller is used in one half of the cycle and the input 
is identically zero during the other half of the cycle. Therefore a weakness of 
this type of controllers is that the system is left in open loop half of the time. In 
this chapter we propose a solution to this problem. 

As compared to [3] and [14] the controller proposed here does not require the 
frequent introduction of periodic «-length sequences of zero inputs. The new 
control strategy is a periodic controller calculated every «-steps, n being the 
order of the system. For technical reasons we still have to introduce a periodic 
sequence of zero inputs but the periodicity can be arbitrarily large. As a result 
the new controller is such that the system is almost always operating in closed 
loop which should lead to better performance characteristics. 



4.2 Problem formulation 

Consider a discrete-time system, described by the following state-space 
representation: 

x t+ \ = Ax, + bu, + b'v' t 
y, = c T x, + v" 



(4.1) 
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where x is the (n x 1) state vector, it, y are respectively the plant’s input and 
output and A, b , c T are matrices of appropriate dimensions. Signals v' v" can be 
identihed as perturbations belonging to a class, which will be discussed later. 
The state part of equation (4.1) is iterated n times: 

x, +n = A n x t + [A"- l b...b\U,+ [A n ~ l b'...b'] V\ (4.2) 

Matrix [A n ~ l b . . .b\ is the system’s (4.1) controllability matrix C(n x n) and 
similarly C = \A n ~ l b ' . . . b 1 ] . Vectors U,, V\ are defined as: 

U,= [u(t)...u(t + n~ 1)] UGTZ" V', = [v'(t)...v'(t + n- 1)] V' G lZ n 

Working in a similar way as in (4.2), an alternative expression for x, can be 
obtained: 

x, = A"xt-„ + CU t -n + C V t _ n (4.3) 

introducing (4.3) into (4.2) becomes: 

x t+n — ^4 2n x t -„ + A"CU t - n + A"C'V' tn + CU t + C V' t (4-4) 

The next step will be to express the system’s output y for a time interval 
[t, t + n— 1], using (4.1): 

y, = c T x, + v" 

y t+ 1 = c T Ax, + c T bu, + c T b'v' t + v" +l 



y ,+ n - 1 = c T A" l x, + c T A" 2 bu, + . . . + c T bu t+n - 2 + c T A" 2 b'v' t + . . . 

+ c T b'v' t+n _ 2 + v'l +n _ l 

which can be written as the following expression: 

Y t+n = Ox, + GU, + a V\ + IV” (4.5) 

where 

0=[c T c T A...c T A n ~ l ] T , O € U nxn (4.6) 

is the system’s observability matrix 





' 0 


O' 






G = 


c T b 


0 


g g n nxn y, = 


yt—n 




_c T A"- 2 b ■■ 


• c T b 0_ 




yt- 1 _ 



and V" is similarly defined as V' but with x! replaced by v" . Identically, G' is 
defined as G in (4.7) with b’ instead of b. From (4.5) we get: 

Y t+ln = Ox,+„ + GU,+„ + G’V t+n + IV” +n 



(4.8) 
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Since the state is not supposed to be measurable, we will obtain an expression 
that depends only on the system’s output and input. Introducing (4.4) into the 
above it follows 



Y ,+ 2 „ = OA 2n x t - n + OA n CU,- H + OA”C'V' t _ n + OCU, + OC'V[ 
+ GU t+n + G'V' t+n + IVl n 



(4.9) 



whereas, by (4.5) 

Y, = Ox t _„ + GU t - n + G' V\_ n + IV'!_ n (4.10) 

and 

x,-„ = 0- l [Y t -G U,-„ - G! V',_ n - IV" t _ n ] (4.11) 

So finally, by substituting (4.11) into (4.9), one has: 

Y, +2 n =DY, + BU, + Bt U t -„ + GU l+n + N t+2 „ (4. 12) 



where t = 0, n, 2 n, . . . and 

D = OA 2n O~ l , B = OC, B' =[OA n C-OA 2n O~ l G}, D,B,B € TZ' ,xn 

(4.13) 



N = 0A n C'V' t _ n -0- l A 2n 0- l G'V' t _ n +0C'V' t +G'V' t+n +IV'; +n -0A 2n 0- l V';_ 



The following control law is proposed for the plant: 

BU, = —DY, — 

The closed loop system becomes: 

Y 1+2 1 1 GUf+n = N t+ 2 n 

whereas from (4.11) and (4.16), one has: 

x, +n = 0 - 1 [N t + 2 n ~ G' V' t+n + IV'; +n \=0- x [N t+2n - Nj +2n ] ( 4 . 17 ) 



(4.14) 

(4.15) 

(4.16) 



where N] +ln is another noise term. Therefore the state x t+n , t = 0, n, 2n , ... is 
bounded by the (bounded) noise. In the ideal case x, +n will be identically zero. 
From (4.3), it is clear that U, is bounded and becomes zero in the ideal case. 



Remark 2.1 Dead-beat control can induce large control inputs. For a 
smoother performance, one can use the following control strategy: 

BU, = —DY, — B U + C[Y, - GU,. n ] (4.18) 

where C has all its eigenvalues inside the unit disc. The closed loop system is in 
this case: 

Y t+2n ~ GU t+n = C[Y, - GU,- n ] + N t+ 2 n (4.19) 

In the ideal case the LHS of the above converges to zero and so does the state. 
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4.3 Adaptive control scheme 



In this section the adaptive control scheme based on the strategy previously 
developed is presented. For sake of simplicity an index k will be associated to t 

as follows: t = kn, k= 0,1,2, The plant representation (4.12) can be 

rewritten as: 

Y k+ 1 = H k + N k+l Y k+ 1 G 1l n (4.20) 

where: 

6= B\Bf\G\D 6 G H nx4n , 4 = [u k . i U k —2 U k Y k - 1 ] T 0 G K 4 " 

(4.21) 

for k = 0, 1, 2, The class of disturbances satisfies: 

IW*)II<»7 (4-22) 

The following a priori knowledge on the plant is required: 

Assumption 1: n and an upper bound q in (4.22) is known 

Assumption 2: The existence of a lower bound bo is required, so that 

bo I < B 1 B. The value of this bound, however, may not be known. 

The equations of the adaptive scheme are given in the order they appear in 
the control algorithm. Note that signals Y k . cp k _ , are measurable. The predic- 



tion error: 

E k = Y k - 0 k -ifa_ i E k G n n (4.23) 

where the variables marked with a bar are the normalized versions of the 
original ones: 

Yk = Y k /[ 1 + ||^_i||], 4-1 = 4-i /[l + 114-1 II] (4.24) 

Least squares with dead zone: 

wi = ElE k + ^ T k _ x Pl_^ k _ x (4.25) 

k = ri/[\ + ||4-i ||] (4.26) 
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Pk 



Pk - 1 



\Pk- \ ( t ) k-\ 4>k- 1 Pk~ 1 p g j^4nx4n 

1 + 



(4.28) 



9 k = 9 k -i + \ k E k %_ l P k =[B k :& k :G k :D k \ 9 k cTZ" x4n (4.29) 



The forgetting factor A can also be chosen to commute between 0 and 1 using 
the same scheduling variable as in [6]. 



Modification of the parameters estimates 

L l (n x An) 



Pk = LkL k > 0 , L 

B k = QkSk ; S k > 0: 



L 2 (3n x An) 



QkQi = I q,S€K” 



(4.30) 

(4.31) 



e, = e k + Q k L\ Ll = \B k \ H k i G k i D k ] (4.32) 

0k 

(4.30) is the so-called Cholesky decomposition which gives a triangular positive 
semi-definite matrix L and (4.31) is a polar decomposition. The first one is often 
found in software packages used for control purposes, as it is a special case of 
the LU decomposition. The second one can be calculated in many ways. The 
way that is described here consists of calculating the singular value decomposi- 
tion of B: B k = U k S k V k T . 

Then the matrices Q k and S k can be calculated as follows: 

Ok = U k V T k , S k = V k S k V T k (4.33) 



4.4 Adaptive control law 

4.4 . 7 Properties of the identification scheme 

In this section we will present the convergence properties of the parameter 
identification algorithm proposed in the previous section. These properties will 
be essential in the stability of the adaptive control closed loop system. 

Let us define the parametric distance at instant k as: 6 k = 6 — 6 k and matrix 
H k as: 

H k = H k _ i x ^ kEk _ H k eU 4nxAn (4.34) 

1 + h^k-lPk-l^k-l 

The following properties hold: 
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(1) The forgetting factor A in (4.27) satisfies: 

0 5: Afc < a:, — a (4-35) 

(2) There is a positive definite function V k defined as: Vk = tr (P k + H k ) that 
satisfies: Vk < Vk - 1 

(3) The augmented error Wk in (4.25) is bounded as follows: 

lim sup (w\ — 6 k ( 1 + a)n) < 0 (4.36) 

k — >oo 

(4) The covariance matrix converges. 

(5) The forgetting factor A is such that: YaLo ^1 < 00 . 

(6) The modified parameter estimate B k satisfies the following: 

-r bnl 

(4-37) 

(7) Ok converges and: 0 k 0 k < Vgl 

(8) The modified prediction error Ek defined as: 

E k = Y k - e k -i$k - 1 4e n" (4.38) 

is bounded as follows: 

E T k E k < 2 wl (4.39) 

Proofs of the above properties are given in [3]. 



4.4.2 Adaptive control strategy 

The expression (4.38) can be written as follows (see also (4.24)): 

p _Yk - ©A— 1 4*k— 1 ± Qk-l&k-l 

k ~ 1 + 11^-1 II 

_ Yk - ©A--20A-1 + (©At- 2 ~ ©At- 1 ) 1 

1 + ||^-l|| 



From (4.21) and (4.32): 

Yk - B k _ 2 Uk ~ 2 — 0 k - 2 Uk-3 - G k - 2 U k -\ — D k _ 2 Y k-i 



E k = 



l + \\Uk- 2 Uk-3Uk-iYk- 2 \ 

(©fc- 2 — G>k-l)<t>k-' 

1 + Il0fc-lll 



1 



(4.40) 
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Using (4.32), we obtain: 



(Qa- 2 ~ Qfr-l )</>&--! 
1 + || 4 - 1 1 



(©£- 2 — ©*-l)4-l 

(Ok- 2 ~ 0k-l)4>k- 1 + (Pk-lL 7 k-2 ~ Pk-\L T k-\)4>k-\ 

(4.41) 



Note that property (7) tells us that: \\0k~i — Ok- 1 || — ► 0 as k —> oo. 

Note also that the last term in (4.41) can be rewritten as: 

(Pk-iL 7 k-2 - Pk-\L T k-i ± Pk-2L T k-\)4>k-\ 

= Pk-2(L T k-2 ~ L T k-\)4>k-\ + (4-2 ~ Pk-\)L J k-\4>k-\ (4.42) 

But \\L T k -2 — L T k-i\\ — * 0 because matrix L/ c results from the Cholesky 
decomposition of the covariance matrix P in (4.30), which converges. On the 
other hand, using (4.25), term L T k~\4>k-\ in (4.42) can be written as: 

(L T k-l4)k-\) T (L T k-\4>k-\) = $k-\Pk-\4>k-l 

< u T k -A\ Il4-i4-i|| 



< sJ(j)l_ x P T k-\Pk-\(t>k-\ (using (4.24)) 

< \J E T kEk + 4>l_ x P T k-\Pk-\4>k-\ 

= Vwh (4.43) 

Using (4.41), (4.42), (4.43) and the facts that as k — » oo, (Ok - 2 — Ok- 1 ) — > 0 
and \\(p k _ 2 — Pk—i ) II < 00 , we can conclude that: 



||(e*- 2 -e*- 1 )&-i|| < \\Pk-2-0k-i\\yfiw\ + \M (4.44) 

where 

6t = (Ok - 2 — Ok-i)<j>k-i + Pk-2(L T k-2 — L T k-i)<t>k-\ (4-45) 



and II^H — > 0 as k — > 00 . 

From (4.40) and using (4.39) and the above we have: 

II Y k - Bk- 2 U k - 2 - BU- 2 Uk-2 ~ Gk_ 2 Uk-i - Dk- 2 Yk-2\\ 

1 + || Ufc_ 2 U k _ 3 U k _ 1 Y k _ 2 || 

< 11411 + 11(0^2-0^-1)^-1 II 

< V2\wk\ + 1 1 4—2 — (3k-i\\VWk\ + 11411 (4.46) 

or equivalently 

II Y k+l - B k , U k - 1 - 4/,-! Uk-2 ~ G k -\Uk - 1 Y k -i || 

l + Ht^jt^-jC^^-ill 



< V2K +1 | + 114-1 - 4llv / K+iT + nail (4.47) 
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Using the certainty equivalence principle the adaptive control strategy could 
simply be obtained from (4.15) by replacing the true parameters by their 
estimates. However, for technical reasons that will appear clear later, we will 
have to introduce some sequences of inputs equal to zero. These sequences are 
introduced periodically after a time interval of arbitrary length L. We will 
assume that k > k* where k* is a time instant large enough such that the RHS 
of (4.47) is small enough. This will be made clear in (4.50). Since some 
sequences of inputs equal to zero are periodically introduced let us assume 
that k > k* is the first time instant after k* that we apply the zero sequences 
defined below: 
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II 
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II 
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fei 
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fei 
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£4+2 = ■ 




U4+2 


34+2 
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£4+0 


Uk + 3 = 






etc. 









Uk+L-2 ~ 0 

Uk+L-l = —Bk+L-l i—k+L— 1 Y k+L-l + S-k+L-l V k+L-2 ) 
Uk+L = 0 

U k+L + 1 = -B-k+L+\(—k+L+l Y k+L + 1 + &k+L+l U k+l) 
U k+L+ 2 = -B-k+L+2(—k+L+2 Y k+L+2 + &k+L+2 U k+L+l) 

£ 4 + 1+3 = etc - 



(4.48) 



The convergence analysis will be carried out by contradiction. We will 
assume that the regressor vector fa goes to infinity and prove that this leads 
to a contradiction. Note that if ||<^|| — » oo then from (4.26) and (4.36) it 
follows that || w£|| — * 0. 

Introducing the control strategy (4.48) into (4.47) we obtain 

fell! 

l + ll U R _, 0 0 Y~ J 



0 



(4.49) 
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Note from (4.48) that U k _ ] = —B^ x (D k _ x Y k _ , ) . Therefore U k _ l is bounded 
by Y k _ ] . Introducing this fact into (4.49) 



II 

1 + 1! > A , I • 

or equivalently 

l|liE + ill<e(l + ll +£-ill) 

where e > 0 is arbitrarily small. 

Using equation (4.11), (4.51) and the fact that U k = 0, one has 

INI + ll^'llll^+ill + C + ^ll +£-,11 +c 



(4.50) 

(4.51) 



(4.52) 



Hereafter to simplify notation, ( will generically represent bounded terms. ( 
may denote the bound for the system noises or the bound for different matrix 
norms. We will not give the exact expression for £ because it is irrelevant for the 
analysis purposes. Combining (4.52 ) and (4.3), we have 

||^fc + i || < eCll +£-i II + C (4-53) 



From (4.3) we get 



x k+2 ~ A " x k + 1 + ^^£+1 + 



k + 1 



(4.54) 



where U k+] is given in (4.48) 



|^ + i|| = 



— £+1 (— £+1 U k + — £+1 +£+i) 



-B, \ D, i , Y, 



—k+l—k+l 1 £+1 



(4.55) 



< £ 



—£+!—£+ 1 



(1 + || +£_, ||) (see(4.51)) 



Consequently one has (see (4.53), (4.54), (4.55)): 

1 1 *£+2 1 1 < £ CII +£-i II + C (4.56) 

We may proceed in the same way to obtain an expression for | x k+3 1 1 . 
Consider the following expressions from (4.3), (4.48), (4.10) respectively: 

X £+3 = +" x £+2 + CU k+2 + C'V' k+2 

U k +2 = — — £+2 (—£+2 ^£+i + D-k +2 +£+ 2 ) 

+£ + 2 = Ox.k +l + GU k+l + G'v' k+l + IV” +{ 



(4.57) 

(4.58) 

(4.59) 
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From the above and using (4.53) and (4.55), we obtain: 



Y k+2 < Ox k+l 



GU, 



k+\ 



+ c 



< <11 Yk -\ II + C 



(4.60) 



From the above and (4.55)-(4.58) one has: 

IN+*|[<^ 3 || F^H + C (4.61) 

Introducing (4.48) and (4.59), one has 

||*fc+aff < eh || x k _ 2 \\ +C (4.62) 

It should be noted that h 3 is a finite quantity which, for a given system, 
depends only on the length of the time interval [k — 2, k + 3] . We may extend 
this procedure over the interval [k — 2, k — 2 + L] . Then (4.62) becomes: 

ll-Xfc-2+J < <l|| *£—211 + C (4-63) 



where bx is a quantity similar to b 3 in (4.62). Finally we may repeat the same 
procedure for iL , / = 1,2,..., in order to obtain an expression for | x k _ 2+jL \ 



H-2+iL < eb L 



x k-2+(i-\)L 



+ c 



(4.64) 



Expressions (4.63) and (4.64) can be combined to obtain: 



Xc 



k~2+iL | 



< {eb L y \\x k _ 



{eb L y C + ( e ^i) , C + • • ■ + + C (4.65) 



Factor [(ehz.)' -1 ^ + ( ebL)' 2 C, + . . . + eb lC, + C] will be a sum of an infinite 
number of terms. But as sh/_ <C 1 this sum will be bounded. Hence we will have 
a bounded decreasing state ||.*T_ 2+ ,x|| as follows: 

||*£-2+/x|| < (£^)'||*£_ 2 || + C V/ = 1, . . . (4.66) 



From (4.66) we conclude that the state x k _ 2+iL for i= 1 , . . . , oo is bounded in 
the limit by a term that depends on the noise. The states at other time instants 
inside the intervals [k — 2 + (i — 1)L, k — 2 + iL] can be proven to be bounded 
too, by using the procedure described in (4.56) up to (4.62). This clearly 
contradicts our assumption that 0 diverges. Therefore all the signals remain 
bounded. Furthermore in view of the previous analysis, the smaller the noise 
upper bound, the smaller the state becomes in the limit. 



4.5 Simulations 

4.5.1 System without noise 

The performance of the proposed adaptive control scheme is illustrated in this 
section through a series of simulations. The following non-minimum phase 
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system is considered: 

x t +i 




f 


Xt + 


1 


0_ 




— 1.5 



y t = [1 0]x t + v" 



(4.67) 



The system’s poles and zeroes are respectively (1, 2) and 1.5. In this way 
matrices B, B' , G, D are: 





'1.5 1 ' 


, & = 


'-10.5 


2.5' 




o 

o 




'-14 


15' 


B = 


.2.5 1.5. 


.-22.5 


4.5. 


, G = 


.1 0 . 


, D = 


.-30 


31. 





(4.68) 



The initial parameter estimates B 0 , B' n , Go, D 0 are chosen in such a way 
that the signs of the determinants of Bq and B are opposite. This will lead to 
important conclusions concerning the utility of the modification of the param- 
eters’ estimates. So the initial parameter estimates are: 

r 2 1 —10.5 2.5 0 0 -14 151 

0 0 = 4.69 

[2.5 1.5 -22.5 4.5 1 0 -30 31 J v 7 

and initial conditions are: zero control input and state xo = [100, 100] 7 for 
t € [0, 8]. The parameters’ modification was used only when the determinant of 
the estimate of B was below a threshold of 0.1. 

It is clear that (3k in (4.32) can be multiplied by a scalar without losing the 
convergence properties. Such a coefficient can be used to reduce the abrupt 
changes in the modified parameter estimates and therefore improve the 
transient behaviour. In our case (3 in (4.32) was multiplied by a coefficient 
equal to 0.001. The initial value of the covariance matrix is chosen as 
P = 220/8, whereas there is no noise corrupting the system. On the other 
hand sequences of zero inputs have not been used in the simulations that 
follow. Figures that are referred hereafter are given in the appendix. 

Figures 4.1 and 4.2 show that modifying the parameter estimates helps to 
avoid unboundedness of the plant signals y(t) and u(t). The results that are 
presented in Figure 4.2 clearly denote that at around k = 22 or t = 44 there is a 
value of det(B) smaller than 0.1 in the case of unmodified parameters. The 
values of det(B) that follow are all very close to zero, a fact that leads to the 
behaviour shown in Figure 4.1. On the other hand, when the modification of 
the paprameters estimates is applied, the critical value of det(B) at t = 44 is 
avoided and the evolution of det(B) thereafter is clearly away from zero. 



4.5.2 System with noise 

Simulation results are now presented in the case when the system is corrupted 
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Figure 4.1 Up: WITHOUT modification, Down: WITH modification of parameter 
estimates 
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estimates. Note t = 2k 
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Figure 4.3 Up: WITHOUT modification, Down: WITH modification of parameter 
estimates 




Figure 4.4 O: WITHOUT modification, *: WITH modification of parameter B esti- 
mate. Note: t = 2k 
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with noise. System (4.67) remains the same with the exception of 
b' = b = [1 — 1.5] 7 in (Figure 4.1) and of a sinusoidal noise described by: 

vf, = 0.0004 sin (2k) + 0.0003 cos (2k) (4.70) 

v", = 0.0004 sin (2k) - 0.0003 cos (2k) (4.71) 

Initial conditions are the same as in the ideal case (i.e. zero control input and 
state xo = [100 100] 7 until t = 8) and P = 200/8. Figure 4.3 shows the 
evolution of output and input signals and Figure 4.4 explains how values of 
determinant of B estimates close to zero can lead to huge output and input 
signals. 



4.6 Conclusions 

This chapter has presented an indirect adaptive periodic control scheme based 
on the lifted representation of the plant proposed in [3]. New arguments have 
been presented so that this technique appears in a better position as a solution 
to the long standing problem of singularities in adaptive control of non- 
minimum phase plants. Contrary to other techniques, the only a priori 
knowledge required on the plant is its order besides the standard controll- 
ability/observability assumption. As compared to previous studies in the 
subject [3], [14], the proposed controller does not require the frequent 
introduction of periodic n-length sequences of zero inputs. Furthermore 
simulations have shown that in practice there is no need to introduce sequences 
of inputs equal to zero. Therefore the new controller is such that the system 
always operates in closed loop which leads to better performance character- 
istics. Simulation results have also shown that the use of the proposed estimate 
modification can significantly reduce signal peaks during the transient period 
even in the case when signals y(t) and u(t) of the original unmodified system do 
not become unbounded. 
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Adaptive stabilization of 
uncertain discrete-time 
systems via switching 
control: the method of 
localization 

P. V. Zhivoglyadov, R. H. Middleton and M. Fu 



Abstract 

In this chapter a new systematic switching control approach to adaptive 
stabilization of uncertain discrete-time systems with parametric uncertainty is 
presented. Our approach is based on a localization method which is concep- 
tually different from supervisory adaptive control schemes and other existing 
switching adaptive schemes. Our approach allows for slow parameter drifting, 
infrequent large parameter jumps and unknown bound on exogenous dis- 
turbance. The unique feature of localization based switching adaptive control 
distinguishing it from conventional adaptive switching schemes is its rapid 
model falsification capabilities. In the LTI case this is manifested in the ability 
of the switching controller to quickly converge to a suitable stabilizing 
controller. We believe that the approach presented in this chapter is the first 
design of a switching controller which is applicable to a wide class of linear 
time-invariant and time-varying systems and which exhibits good transient 
performance. The performance of the proposed switching controllers is 
illustrated by many simulation examples. 



5.1 Introduction 

Control design for both linear and nonlinear dynamic systems with unknown 
parameters has been extensively studied over the last three decades. Despite 
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significant advances in adaptive and robust control in recent years, control of 
systems with large-size uncertainty remains a difficult task. Not only are the 
control problems complicated, so is the analysis of stability and performance. 

It is well known [12,14] that classical adaptive algorithms prior to 1980 were 
all based on the following set of standard assumptions or variations of them: 

(i) An upper bound on the plant order is known. 

(ii) The plant is minimum phase. 

(iii) The sign of high frequency gain is known. 

(iv) The uncertain parameters are constant, and the closed loop system is free 
from measurement noise and input/output disturbances. 

Classical adaptive algorithms are known to suffer from various robustness 
problems [34], A number of attempts have been made since 1980 to relax the 
assumptions above. A major breakthrough occurred in the mid-1980s [17, 21, 
35] for adaptive control of LTV plants with sufficiently small in the mean 
parameter variations. Later attempts were made for a broader class of systems. 
Fast varying continuous-time plants were treated in [36], assuming knowledge 
of the structure of the parameter variations. By using the concept of 
polynomial differential (integral) operators the problem of model reference 
adaptive control was dealt with in [32] for a certain class of continuous-time 
plants with fast time-varying parameters. An interesting approach based on 
some internal self-excitation mechanism was considered in [7] for a general 
class of LTV discrete-time systems. The global boundedness of the state was 
proved. However, it must be noted that the presence of such self-excitation 
signals in a closed loop system is often undesirable. 

In another research line, a number of switching control algorithms have been 
proposed recently by several authors [2, 6, 8, 20, 23, 24, 31], thus significantly 
weakening the assumptions in (i)-(iv). Both continuous and discrete linear 
time-invariant systems were considered. Research in this direction was 
originated by the pioneering works of Nussbaum [31] and Martensson [20]. 
Nussbaum considered the problem of finding a smooth stabilizing controller 

( z(t) =f(g(t),z(t)) 

\u(t) = g(y(t),z(t)) 

for the one-dimensional system 

( x(t) = ax(t)+qu(t) 
l y{t) = x{t) 

with both q ^ 0 and a>0 unknown. In [31] Nussbaum describes a whole 
family of controllers of the form (5.1) which achieve the desired stabilization of 
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the system (5.2). For example, it was shown that every solution (x(t),z(t)) of 
x = ax + qx[z 2 + 1) cos(7rz/2) exp z 2 



z = x(z 2 + 1) 



(5.3) 



has the property that lim,^,*, x(t) = 0 and lim^oo z{t) exists and is finite. We 
note that the structure of the adaptive controller is explicitly seen from (5.3). 
Another important result proved in [31] is that there exists no stabilizing 
controller for the plant (5.2) expressed in terms of polynomial or rational 
functions. A more general result was presented by Martensson [20]. In 
particular, it was shown in [20] that the only a priori information which is 
needed for adaptive stabilization of a minimal linear time-invariant plant is the 
order of a stabilizing controller. This assumption can even be removed if a 
slightly more complicated controller is used. Consider the following dynamic 
feedback problem. Given the plant 



and the controller 



x = Ax + Bit, x G R", u G R m , 
y = Cx, y G IF 

f z = Fz + Gy, z G R / 

1 u = Hz + Ky 



(5.4) 



(5.5) 



where m, r are known and fixed, and n is allowed to be arbitrary. It is easy to 
see that this is equivalent to the static feedback problem 



(5.6) 



where x = ( x T z T ) T , u = (u T z T ) T , y = ( y T z T ) T and A, B , C, and K are matrices 
of appropriate dimensions. Let the regulator be 




u = g(h(k))N(h(k))y 



k = 



(5.7) 



where N(h) is an ‘almost periodic’ dense function and h and g are continuous, 
scalar functions satisfying a set of four assumptions (see [20] for more details). 
Martensson’s result reads: ‘Assume that / is known so that there exists a fixed 
stabilizing controller of the form (5.5), and that the augmentation to the form 
(5.6) has been done. Then the controller (5.7) will stabilize the system in the 
sense that 



(x(t), z(t), k(t)) — > (0, 0, koo) as t — > oo 



(5.8) 



where k ^ < oo’. 
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One such set of functions given by Martensson is 

h(k) = (log A:) 1 / 2 , k > 1, g{h) = (sin /z 1/2 + l)/? 1 / 2 (5.9) 

Martensson’s method is based on a ‘dense’ search over the control parameter 
space, allows for no measurement noise, and guarantees only asymptotic 
stability rather than exponential stability. These weaknesses were overcome 
in [8] where a finite switching control method was proposed for LTI systems 
with uncertain parameters satisfying some mild compactness assumptions. 
Different modifications of Martensson’s controller aimed at achieving 
Lyapunov stability, avoiding dense search procedures, as well as extending 
this approach to discrete-time systems have been reported recently (see, e.g., [2, 
8, 19, 23]). However, the lack of exponential stability might result in poor 
transient performance as pointed out by many researchers (see, e.g., [8,19] for 
simulation examples). Below we present a simple example of a controller based 
on a dense search over the parameter space. This controller is a simplified 
version of that presented in [19]. 

Example 1.1 The second order plant 

x(t + 1) = aix(t) + aix(t — 1) + bu(i) + £(f), x,«eR (5.10) 

with ci\ 2 G R, b ^ 0 being arbitrary unknown constants and sup ; >- |£(t)| < oo 
has to be controlled by the switching controller 

u(t) = k(t)x(t) (5-11) 

where k(0) = /z(l) and k{t) = h(i), t G (/,-, /,■+ 1 ] and h{i) is a function dense in R 
defined so that it successively looks at each interval [—p, p], p G N and tries 
points l/2 p apart, namely, 

A(l) = l h{ 4) = -0.5 h{ 7) = 1.75 

h{ 2) =0.5 h{5) = — 1 etc. 

h{ 3) = 0 h{ 6) = 2 



The system performance is monitored using a function 

T(l) = } |x( f,-_ ! ) | + (5.12) 



For each i > 1 such that t,_i ^ oo, the switching instant is defined as 



u — 



min{? : t> f,_i, |x(f)| >M(?,_i)/3(t,_i) ( ' fi) |x(fi-i)| + K0-i)} 
oo otherwise 



if this exists 
(5.13) 



where 0 < M(t), 0 < (3{t) < 1 and 0 < u(t) are strictly positive increasing 
functions satisfying the following conditions lirn^oo M(t) = + 00 , 
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Figure 5.1 Example of a dense search 



Hindoo p{t) = 1, lim^oo v(t) = + oo. The behaviour of the closed loop system 
with ci] = —2.2, a 2 = 0.3 and b — 1 is illustrated in Figure 5.1(a)-(b). 

A different switching control approach, called hysteresis switching, was 
reported in a number of papers [22, 27, 37] in the context of adaptive control. 
In these papers, the hysteresis switching is used to swap between a number of 
‘standard’ adaptive controllers operating in regimes of the parameter space. 
The switching, in these cases, is used to avoid the ‘stabilizability’ problem in 
adaptive controllers. 

Conventional switching control techniques are all based on some mechanism 
of an exhaustive search over the entire set of potential controllers (either a 
continuum set [20] or a finite set [8]). A major drawback is that the search may 
converge very slowly, resulting in excessive transients which renders the system 
‘unstable’ in a practical sense. This phenomenon can take place even if the 
closed loop system is exponentially stable. To alleviate this problem, several 
new switching control schemes have been proposed recently. The so-called 
supervisory control of LTI systems for adaptive set-point tracking is proposed 
by Morse [25, 26] to improve the transient response. A further extension of 
Morse’s approach is given in [13]. A very similar, in spirit, supervisory control 
scheme for model reference adaptive control is analysed in [29], The main idea 
of supervisory control is to orchestrate the process of switching into feedback 
controllers from a pre-computed finite (continuum) set of fixed controllers 
based on certain on-line estimation. This represents a significant departure 
from traditional estimator based tuning algorithms which usually employ 
recursive or dynamic parameter tuning schemes. This approach has apparently 
significantly improved the quality of regulation, thus demonstrating that 
switching control if properly performed is no longer just a nice theoretical 
toy but a powerful tool for high performance control systems design. However, 
several issues still remain unresolved. For example: 
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(i) a finite convergence of switching is not guaranteed. This aspect is 
especially important in situations when convergence of switching is 
achievable. It seems intuitively that in adaptive control of a linear time- 
invariant system it is desirable that the adaptive controller ‘converges’ to a 
linear time-invariant controller; 

(ii) the analysis of the closed loop stability is quite complicated and often 
dependent on the system architecture. Without a simpler proof and better 
understanding of the ‘hidden’ mechanisms of supervisory switching 
control its design will remain primarily a matter of trial and error. 

In this chapter, we present a new approach to switching adaptive control for 
uncertain discrete-time systems. This approach is based on a localization 
method, and is conceptually different from the supervisory control schemes 
and other switching schemes. The localization method was initially proposed 
by the authors for LTI systems [39]. This method has the unique feature of fast 
convergence for switching. That is, it can localize a suitable stabilizing 
controller very quickly, hence the name of localization. Later this method 
was extended to LTV plants in [40]. By utilizing the high speed of localization 
and the rate of admissible parameter variations exponential stability of the 
closed loop system was proved. The main contribution of this chapter is a 
unified description of the method of localization. We show that this method is 
also easy to implement, has no bursting phenomenon, and can be modified to 
work with or without a known bound on the exogenous disturbance. 

To highlight the principal differences between the proposed framework and 
existing switching control schemes, in particular, supervisory switching control, 
we outline potential advantages of localization based switching control: 

(i) The switching controller is finitely convergent provided that the system is 
time invariant. Depending on how the switching controller is practically 
implemented the absence of this property could potentially have far 
reaching implications. 

(ii) Unlike conventional switching control based on an exhaustive search over 
the parameter space, the switching converges rapidly thus guaranteeing a 
high quality of regulation. 

(iii) The closed loop stability analysis is comparatively simple even in the case 
of linear time-varying plants. This is in sharp contrast to supervisory 
switching control where the stability analysis is quite complicated and 
depends on the system architecture. 

(iv) Localization based switching control is directly applicable to both linear 
time-invariant and time-varying systems. 

(v) The localization technique provides a clear understanding of the control 
mechanism which is important in applications. 

The rest of this chapter is organized as follows. Section 5.2 introduces the class 
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of LTI systems to be controlled and states the switching adaptive stabilization 
problem. Two different localization principles are studied in Sections 5.3 and 
5.4. We also study a problem of optimal localization, which allows us to obtain 
guaranteed lower bounds on the number of controllers discarded at each 
switching instant and adaptive stabilization in the presence of unknown 
exogenous disturbance. Simulation examples are given in Section 5.5 to 
demonstrate the fast switching capability of the localization method. 
Conclusions are reached in Section 5.6. 



5.2 Problem statement 

We consider a general class of LTI discrete-time plants in the following form: 

D(z~ x )y(t) = N(z~ x )u(t) + £(t — 1) +V(t~ 1) (5-14) 

where u{t) is the input, y(t) is the output, £(f) is the exogenous disturbance, q{t) 
represents the unmodelled dynamics (to be specified later), z _ 1 is the unit delay 
operator: 

N{z~ l ) = «iz~' + niz^ 2 + . . . + n„z~" (5.15) 

D(z~ l ) = 1 + diz~ l + . . . + d n z~ n (5.16) 

Remark 2.1 By using simple algebraic manipulations, measurement noise and 
input disturbance are easily incorporated into the model (5.14). In this case, 
y(t), u(t), and £(/) represent the measured output, computed input and 
(generalized) exogenous disturbance, respectively. For example, if a linear 
time-invariant discrete-time plant is described by 

y{ z ) = ^pry M z ) + d ( z )) + ?( z ) 

where d(z) and q(z) are the input disturbance and plant noise, respectively, the 
plant can be rewritten as 

D(z _1 )y(z) = N(z~')u(z) + ( N(z~ l )d(z ) + Z^r -1 )^ -1 )) 

Consequently, the exogenous input £(z) is N(z~ l )d(z) + D(z~ l )q(z~ l ). 

We will denote by 6 the vector of unknown parameters, i.e. 

9= («„,... ,n 2 ,-d„,...,-d u ni) T (5.17) 

Throughout the chapter, we will use the following nonminimal state-space 
description of the plant (5.14): 

x(t + 1) = A(0)x(t) + B(9)u(t ) + E(£(t) + r/(t)) 



(5.18) 
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We also define the regressor vector 




Then, (5.14) can be rewritten as 

y (t) = e T (t>(t - i) + £(t - i) + v(t - i) 



(5.21) 

(5.22) 

(5.23) 



The following assumptions are used throughout this section: 

(Al) The order n of the nominal plant (excluding the unmodelled dynamics) is 
known. 

(A2) A compact set f l € R 2 ", is known such that 9 € ft. 
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(A3) The plant (5.14) without unmodelled dynamics (i.e. rj(t) = 0) is stabiliz- 
able over Q. That is, for any 8 € fi, there exists a linear time-invariant 
controller C{z~ { ) such that the closed loop system is exponentially stable. 
(A4) The exogenous disturbance £ is uniformly bounded, i.e. for all to G N 

sup|£(i)|<£ (5.24) 

t>t 0 

for some known constant £. 

(A5) The unmodelled dynamics is arbitrary subject to 

MO I < h(0 = e SU P cr,_A ll x (^)ll (5.25) 

0 <k<t 

for some constants £ > 0 and 0 < cr < 1 which represent the ‘size’ and 
‘decay rate’ of the unmodelled dynamics, respectively. 

Remark 2.2 Assumption (Al) can be relaxed so that only an upper bound 
« max is known. Assumption (A4) will be used in Sections 5. 3-5. 4 and will be 
relaxed to allow £ to be unknown in Sections 5.3.2 and 5.4.1 where an 
estimation scheme is given for £. 

Remark 2.3 We note that the assumptions outlined above are quite standard 
and have been used in adaptive control to derive stability results for systems 
with unmodelled dynamics (see, e.g., [7, 16, 21, 30] for more details). 

The switching controller to be designed will be of the following form: 

u{t) = K i{t) x(t) (5.26) 

where is the control gain applied at time t, and i(t) is the switching index at 
time t, taking value in a finite index set I. The objective of the control design is 
to determine the set of control gains 

Ki = {K,,i€l} (5.27) 

and an on-line switching algorithm for i(t) so that the closed loop system will 
be ‘stable’ in some sense. 

We note that switching controllers can be classified according to the logic 
governing the process of switching. Here are some typical examples. 



7. Conventional switching control 
The switching index is defined as 



(i(t- 1) if G, < 0 
\ i(t — 1) + 1 otherwise 



(5.28) 



where G, is some appropriately chosen performance index. This type of 
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switching control is finitely convergent and based on an exhaustive search over 
the parameter space (see, e.g., [8], [9]). 



2 Supervisory switching control 
The switching index is defined as 

K*- !) 

arg min,. e/ 



i(t) = 



\ e i(t)\ 



if t — s(t) < td 
otherwise 



(5.29) 



where s(t) is the time of the most recent switching, td is a positive dwell time, 
and ej(t'), Mi £ I is a weighted prediction error computed for the z'th nominal 
system. This type of switching control has been extensively studied recently by 
a number of researchers (see, e.g., [25, 26]). The proof of the closed loop 
stability in this case is not dependent on finite convergence of the switching 
process, furthermore, supervisory switching control is not finitely convergent in 
general. 



5.3 Direct localization principle 



The switching algorithms to be used in this section are based on a localisation 
technique. This technique, originally used in [39] for LT1 plants, allows us to 
falsify incorrect controllers very rapidly while guaranteeing exponential stab- 
ility of the closed loop system. In this section, we describe a direct localization 
principle (see, e.g., [40]) for LTI plants which is slightly different from [39] but 
is readily extended to LTV plants. The main idea behind this principle consists 
of simultaneous falsification of potentially stabilizing controllers based ex- 
plicitly on the model of the controlled plant. That implies the use of some effective 
mechanism of discarding controllers inconsistent with the measurements. 

The specific notion of stability to be used in this section is described below. 

Definition 3.1 The system (5.14) satisfying (A1)-(A5) is said to be globally £- 
exponentially stabilized by the controller (5.26) if there exist constants M\ > 0, 
0 < p < 1, and a function TF?)-) : R + — > R + with = 0 such that 

I WOH < MjW^IWWII + m 2 (0 (5.30) 

holds for all to > 0, x(?o). £ > 0, and £(•) and r/(-) satisfying (A4)-(A5), 
respectively. 

The definition above yields exponential stability of the closed loop system 
provided that £ = 0 and exponential attraction of the states to an origin 
centred ball whose radius is related to the magnitude of the exogenous 
disturbance. 

First, we decompose the parameter set fl to obtain a finite cover 
which satisfies the following conditions: 
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(Cl) O, C fi, i = l, . . .,L. 

(C2) Uf=i fi/ = fi- 

(C3) For each i = 1, . . . ,L, let 6j and r,- > 0 denote the ‘centre’ and ‘radius’ of 
Qj, i.e. 9j G f ii and ||0 — 0,|| < r,- for all 6 & f l,-. Then, there exist K u 
i = l, ... ,L, such that 

|A max (zl(0) + B(0)Ki)\ < 1, V 110-0/11 <r u i=l,...,L. (5.31) 

Conditions (C1)-(C2) basically say that the uncertainty set fi is presented as a 
finite union of non-empty subsets while condition (C3) defines each subset fl,- as 
being stabilizable by a single LTI controller K t . It is well known that such a 
finite cover can be found under assumptions (A1)-(A3) (see, e.g., [8, 24, 25] for 
technical details and examples). More specifically, there exist (sufficiently large) 
L, (sufficiently small) r,-, and suitable K,,i = 1 such that (C1)-(C3) hold. 

Leaving apart the computational aspects of decomposing the uncertainty set 
satisfying conditions (Cl) (C3) we just note that decomposition can be 
conducted off-line, moreover, some additional technical assumptions (see, 
e.g., (C3') below) make the process of decomposing pretty trivial. The 
computational complexity of decomposing the uncertainty set, in general, 
depends on many factors including the ‘size’ of the set, its dimension and 
‘stabilizability’ properties, and has to be evaluated on a case-by-case basis. 

The key observation used in the localization technique is the following fact: 
given any parameter vector 6 € fly and a control gain K^ t ) for some 
i{t),j = 1, . . . , L. If i(t) = j, then it follows from 

y(t) = 6 T (j){t — 1) + £(t — i) + r/(t — i) (5.32) 

that 

i) -y(t ) I < r j\\<f>(t - 1)|| + £ + fj{t- 1) (5.33) 

This observation leads to a simple localization scheme by elimination: If the 
above inequality is violated at any time instant, we know that the switching 
index i(t) is wrong (i.e. i(t) ^ j), so it can be eliminated. In identification theory 
this concept is sometimes referred to as falsification; see, e.g., a survey [15] and 
references therein. The unique feature of the localization technique comes from 
the fact that violation of (5.33) allows us not only to eliminate i(t) from the set 
of possible controller indices, but many others. This is the key point! As a 
result, a correct controller can be found very quickly. 

We now describe the localization algorithm. Let I(t) denote the set of 
‘admissible’ control gain indices at time t and initialize it to be 

/(*„) = {1,2,..., L} (5.34) 

Choose any initial switching index i(to) G /(to)- For t > to, define 

7(0 = {j: (5.33) holds, j=l,...,L} (5.35) 

Then, the localization algorithm is simply given by 
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I(t) = /(/- 1) n I(t), Mt > t 0 (5.36) 

The switching index is updated by taking 1 

i{i) = { ~ ^ if t > t o and i(t - 1) € I{t) 

\ any member of I(t) otherwise 

A simple geometrical interpretation of the localization algorithm (5.36) is given 
in Figure 5.2. One possible way to view the localization technique is to interpret 
it as family set identification of a special type, that is, family set identification 
conducted on a finite set of elements. Interpreted in this way the localization 
technique represents a significant departure from traditional family set 
identification ideas. Either strip depicted in Figure 5.2 contains only those 
elements which are consistent with the measurement of the input/output pair 
{y(t),u(t — 1)}. The high falsifying capability of the proposed algorithm 
observed in simulations can informally be explained in the following way. 
Let the index i(t) be falsified, then the discrete set of elements { 6 ,■ : i e /(/)} 
consistent with all the past measurements is separated from the point ht) b y 
one of the hyperplanes 

1) = y{t) + rj\\4>{t - 1)11 +£ + fj(t- 1) (5.38) 

or 

1) = y(t) ~ rj\\<t>{t - 1)11 1) (5.39) 

dividing the parameter space into two half-spaces. It is also clear that every 
element belonging to the half-space containing the point 9^ t ) is falsified by the 
algorithm of localization (5.36) at the switching instant t. We note that 
the rigorous analysis of the problem of optimal localization conducted in 
Section 5.3.1 allows us to derive a guaranteed lower bound on the number of 
controllers falsified at an arbitrary switching instant. A different non-identifica- 
tion based interpretation of localization can be given in terms of the prediction 
errors ej = \6j<f>(t— 1) — j(f)|, j = 1,2, ...L computed for the entire set of 
‘nominal’ models. Thus, any model giving a large prediction error is falsified. 
The following technical lemma describes the main properties of the algorithm 
of localization (5.36). 

Lemma 3.1 Given the uncertain system (5.14) satisfying assumptions (Al)- 
(A5), suppose the finite cover {12,-}^ of 12 satisfies conditions (C1)-(C3). Then, 
the localization algorithm given in (5.34)-(5.37) applied to an LTI plant (5.14) 
possesses the following properties: 

(i)/(0^{|, Vr > t Q . 

1 In fact, we will see in Section 5.3.1 that there may be "clever’ ways of selecting i(t) 
when i(t — 1) is falsified. 
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PARAMETER SPACE 




(ii) There exists a switching index j e I(t) for all t > to such that the closed 
loop system with u(t) = Kj.x(t) is globally exponentially stable. 

Proof The proof is trivial: suppose the parameter vector 9 for the true plant is 
in 0/ for some j € {1, . . . , L}. Then, the localization algorithm guarantees that 
j € I{t ) for all t. Hence, both (!) and (ii) hold. 

To guarantee exponential stability of the closed loop system, we need a 
further property of the finite cover of H. To explain this, we first introduce the 
notion of quadratic stability [3]. 

Definition 3.2 A given set of matrices {A(9) : 9 € 0} is called quadratically 
stable if there exist symmetric positive-definite matrices H, Q such that 

A t (9)HA(9) — H < —Q, Vflefi (5.40) 

It is obvious that the finite cover of fl can always be made such that 
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each flj is ‘small’ enough for the corresponding family of the ‘closed-loop’ 
matrices {A^O) + B(0)Kj : 9 e fi,} to be quadratically stable with some K t . 

In view of the observation above, we replace the condition (C3) with the 
following: 

(C3') For each i = 1, . . . ,L, let dj and r,- > 0 denote the ‘centre’ and ‘radius’ of 
Qj, i.e. 9j G Qj and \\6 — 6j\\ < r, for all 6 € ff . Then, there exist control 
gain matrices K t , symmetric positive-definite matrices Hi and Q,, 
i = 1 .... ,L, and a positive number q such that 

(A{9) + B(9)Kj) T Hj(A(9) + B{9)Ki) - H t < -Q h 

5.41 

V \\9 - 0 f || <(r i + q), i = 1, ... ,L 

Remark 3.1 Condition (C3') requires that every subset fi,- obtained as a result 
of decomposition be quadratically stabilized by a single LTI controller. We 
also note that a finite cover which satisfies (C1)-(C2) and (C3') is guaranteed to 
exist. Moreover, Condition (C3') translated as one requiring the existence of a 
common quadratic Lyapunov function for any subset fi, further facilitates the 
process of decomposition. 

The following theorem contains the main result for the LTI case: 

Theorem 3.1 Given an LTI plant (5.14) satisfying assumptions (A1)-(A5). Let 
{fl ( }| = i be a finite cover of f l satisfying conditions (C1)-(C2) and (C3')- Then, 
the localization algorithm given in (5.34)— (5.37) will guarantee the following 
properties when e (i.e. the ‘size’ of unmodelled dynamics) is sufficiently small: 

(i) The closed loop system is globally ^-exponentially stable, i.e., there exist 
constants M\ > 0, 0 < p < 1, and a function Mi{-) : R+ — > R + , 
M 2 (0) = 0 such that 

I W0II < M lP ^\\x(t 0 )\\ + M 2 (0 (5.42) 

holds for all t > to and x(to). 

(ii) The switching sequence {/(to), i(to + 1), . . .} is finitely convergent, i.e. 
/(/) = const, V / > t* for some t*. 

Proof See Appendix A. 

The proof of the theorem presented in Appendix A is based on the 
observation that between any two consecutive switchings the closed loop 
system behaves as an exponentially stable LTI system subject to small 
parametric perturbations and bounded exogenous disturbance. This is the 
key point offering a clear understanding of the control mechanisms. 

It follows from the proof of Theorem 3.1 that the constant M\ in the bound 
(5.42) is proportional to the total number of switchings made by the controller 
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while the parameter p is dependent on the ‘stabilizability’ property of the 
uncertainty set Q. This further emphasizes the importance of fast switching 
capabilities of the controller for achieving good transient performance. 



5.3 . 7 Optimal localization 

The localization scheme described above allows an arbitrary new switching 
index in /(f) to be used when a switching occurs. That is, when the previous 
switching index if — 1) is eliminated from the current index set /(f), any 
member of /(f) can be used for /(f). The problem of optimal localization 
addresses the issue of optimal selection of the new switching index at each 
switching instant so that the set of admissible switching indices /(f) is 
guaranteed to be pruned down as rapidly as possible. The problem of optimal 
localization is solved in this section in terms of the indices of localization 
defined below. In the following for notational convenience we drop when 
possible the index f from the description of the set of indices /(f). Also we make 
the technical assumption 

(A7) fj = Vj = r , V/, / = 1,2 ,L, and some r > 0. 

For any set / C {1, 2, . . . , L}, 0 = {$,- : i e /}, a fixed j € / and any r^O, 
z e R 2 ", define the function 

A(zj, 0) = |{0 f : & - 9j) t z >0, i e /} | (5.43) 

where | • | denotes the cardinal number of a set. Then 

ind(0y, 0) = min A (z,j, 0) (5-44) 

will be referred to as the index of localization of the element 0j with respect to 
the set 0. 

Lemma 3.2 The index of localization ind(0,-, 0) represents a guaranteed lower 
bound on the number of indices discarded from the localization set /(f) at the 
next switching instant provided that u(t) = Kjx(t). 

Proof Without loss of generality we assume that (f + 1) is the next switching 
instant, and controller Kj is discarded. From (5.35) we have /(f + 1), 
equivalently 

9] 4>{t) > y{t + 1) + (o + 4)110(011 + £ + + 1) (5-45) 

ejm <y{t+ 1) - (0 + 4)11^(011 1) (5-46) 

Taking z= -</>(f)/||<^(f)|| for (5.45), or z = ^(f)/||^(f)|| for (5.46) and using 
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(5.43) we see that there are A (z,j, 0) number of controller indices which do not 
belong to I(t+ 1). We note that f{t ) f 0, because otherwise it is easy to see 
from (5.23) that there exists no element 6j € 0 satisfying (5.45) or (5.46), and, 
consequently, switching is not possible. Since ind(0/,0) < A(zj, 0), we con- 
clude that there are at least ind(0 7 , 0) number of controllers to be discarded at 
the switching instant (?+ 1). 

In terms of (5.44) the index of localization of the discrete set 0 is defined as 
ind 0 = max{ind(6(', 0) : j € /} (5-47) 



That is, ind 0, is the largest attainable lower bound on the number of 
controllers eliminated at the time of switching, assuming that the regressor 
vector can take any value. The structure of an optimal switching controller is 
described by 



u{t) = K i(t) x{t) 



(5.48) 



i{t) 



i(t- 1) if i(t- 1) e /(?); 

*o P t(0 = arg max / {ind(6> 7 , 0(?)) :j € I(t)} otherwise (5-49) 



The problem of optimal localization reduces to determining the optimal 
control law, that is, specifying the switching index i opt (t) at each time instant 
when switching has to be made. To solve this problem we introduce the notion 
of separable sets. 

Definition 3.3 Given a finite set 0 c R" and a subset / C 0; / is called a 
separable set of order k if 

(i) \J\ = k. 

(ii) co {/} p| co {0 — /} = {} where co {■} stands for the convex hull of a 
set. 

The main properties of separable sets are listed below: 

(a) A vertex of co {0} is a separable set of order 1. 

(b) The order of a separable set A'<|0|. 

(c) For each separable set J of order k , k > 1, there exists a set J' c J such 
that J' is a separable set of order (k — 1 ) . 

Proof (a), (b) are obvious. To prove (c), we note that for each separable set /, 
there exists a hyperplane P separating / and 0 — Let n be the normal 
direction of P. Move P along n towards J until it hits /. Two cases are possible. 

Case 1: One vertex is in contact. In this case move P a bit further to pass the 

vertex. The remaining points in J form J' . 
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Case 2: Multiple vertices are in contact. One can always change n slightly so 

that P still separates J and 0 — J, but there is only one vertex in contact with 
P, and we are back to Case 1. 

Lemma 3.3 Let & k be the set of all separable sets of order k and 

Ek = IW Jk - Then ’ 

ind 0=1+ arg max{k : at f 0} (5.50) 

k 

Proof Follows immediately from Definition 3.3 and the property of separable 
sets (c). Indeed, suppose that the index of localization satisfies the relation 

ind 0 = m > 1 + arg ma x{k : a k f 0} (5.51) 

k 

then there must exist an element 8j G 0, such that ind(0/, 0) = m, moreover 

Oji S m ~\ 0j G 0 - S'"' 1 (5.52) 

since otherwise, by definition of separable sets ind(6j / -, 0) < m — 1. But it 
follows from (5.51) that 0 — S'” -1 = {}. On the other hand by Definition 3.3 
and the properties of separable sets (b), (c) the index of localization of the set 0 
cannot be smaller than that given by (5.50). This concludes the proof. 

Denote by V(-) the set of vertices of co (•). The complete solution to the 
problem is given by the following iterative algorithm. 

Algorithm A 

Step 1 Initialize k = 1. Compute 0 1 = {{0} : 9 G F(0)}. 

Step 2 Set k = k + 1. Compute 

& k = {J k -\ U 9i : J k - 1 C ® k ~\ 6j G F(0 — Jk- i), Jk- 1 U 9j is separable} 

Step 3 If a k = 0, then ind 0 = k, and stop, otherwise go to Step 2. 

The properties of localization based switching control are summarized in the 
following theorem. Let sub}-} denote the set of subscripts of all the elements 

in { }• 

Theorem 3.2 

(i) The solution to the problem of optimal localization may not be unique 



and is given by the set 




Iopt = sub{© - S'"' 1 } 


(5.53) 


where 




m = ind 0=1+ arg max} A’ : a 1 ' f 0} 
k 


(5.54) 



(ii) For any £ > 0, e > 0, the total number of switchings / made by the 
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optimal switching controller (5.48), (5.49) applied to an LTI plant (5.14) 
satisfies the relation 

/-i 

ind Q(t p ) — 2 ^ L — 1 (5.55) 

p = o 

where t p , p = 0, 1, 1 denote the switching instants. 

Proof The proof of (i) follows directly from Lemma 3.3. To prove (ii) we note 
that 

|©(?i)| < L-ind 0(t o ); 

|0(r 2 )| < |0(d)| - ind 0(?i) < L — ind 0(t o ) — ind 0(?i) 

then 

i - 1 

|0(</)| <vi = L- ^2 ind 0(d) 

7—0 

Since vj > 1 the result follows. 

Algorithm A applied to an arbitrary localization set 0 indicates that except 
for a very special case, namely, {0j} jeI = F(0), localization with any choice of 
the switching index i(t) such that 9,y) £ F(0) will always result in elimination 
of more than one controller at any switching instant. This is a remarkable 
feature distinguishing localization based switching controllers from conven- 
tional switching controllers. Moreover, a simple geometrical analysis (see, e.g., 
Figure 5.2) indicates that for ‘nicely’ shaped uncertainty sets (for example, a 
convex fl) and large L the index of localization is typically large, that is, 
ind(0) >> 1 . Theorem 3.2 gives a complete theoretical solution to the problem 
of optimal localization formulated above in terms of indices of localization. 
Flowever, it must be pointed out that the search for optimality in general is 
involved and may be computationally demanding. To alleviate potential 
computational difficulties we propose one possible way of constructing a 
suboptimal switching controller. 

Algorithm B 

Step 1 Initialize k = 1. Compute T 1 = F(0). 

Step 2 Set k = k + 1. Compute 

r fc = r k ~ l u ^(©-r*- 1 ) 

Step 3 If T k = 0, then ind 0 ^ /r, cind stop, otherwise ^o to Step 2. 

Algorithm B allows for a simple geometrical interpretation, namely, at each 
step a new set T /c is obtained recursively by adding the set of vertices of 
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(0 — T^ 1 ). The simplicity of the proposed algorithm is explained by the fact 
that we no longer need to check the property of separability (see Step 2 in 
Algorithm A). 

The main property of the Algorithm B is presented in the following 
proposition. 

Proposition 3.3 The index of localization ind 0 satisfies the inequality 

ind 0 > 1 + arg max {k : T A f 0} (5.56) 

k 

Proof The proof is very simple and follows from the fact that for any 0 € 0, 
such that 8£ F(0) it is true that ind(0, 0) > 2. By applying this rule 
recursively we obtain (5.56). 

Example 3.1 To illustrate the idea of optimal (suboptimal) localization we 
consider a simple localization set 0 = depicted in Figure 5.3. 

We note that the point 0 5 is located exactly in the centre of the square 
(0i, 02, 04 , 03 ). Applying Algorithm A to the set 0 we obtain 

©' = m, {0s}, {0 4 }}, 

0 2 = {{0!, 0 2 }, {01, 03}, {02 , 04}, {03, 04}}, 

@ 3 = {{01, 02, 05}, {01, 03, 05}, {03, 04, 05}, {02, 04, 05}} 

Since U /g0 3 J = 0 we conclude that ind 0 = 3 and the optimal switching index 
is given by i(t) = 5. To compute a guaranteed lower bound on the index of 




05 

* 



03 04 

* * 



Figure 5.3 Example of optima / localization 
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localizaton ind 0 Algorithm B is used. We have 
r 1 = {01, 02, 03, 0 4 }, 
r = {01, 02, 03) 04) 05} = © 

therefore, ind 0 > 2. We note that in this particular example the optimal 
solution, that is, i(t) = 5, coincides with the suboptimal one. 

Remark 3.2 To deal with the problem of optimal (suboptimal) localization 
different simple heuristic procedures can be envisioned. For example, the 
following ‘geometric mean’ algorithm of computing a new switching index is 
likely to perform well in practice, though it is quite difficult in general to obtain 
any guaranteed lower bounds on the indices of localization. At any switching 
instant t we choose 

i(t)= arg min 1 1 0/ — ^ 0;/©(OIII (5-57) 

1 



5.3.2 Localization in the presence of unknown disturbance bound 
In this section we further relax assumption (A4) to allow the disturbance 
bound £ to be unknown. That is, we replace (A4) with 

(A4') The exogenous disturbance f is uniformly bounded 

sup|£(i)|<£ (5.58) 

t>to 

for some unknown constant £. 

We further relax assumptions (A1)-(A5) by allowing parameters to be slowly 
varying. To this end we introduce the following additional assumption 

(A7) The uncertain parameters are allowed to have slow drifting described by 

||0(O -6(t- 1)|| < a, d t > to (5.59) 

for some constant a > 0. 

Following the results presented in previous sections, we introduce a general- 
ized localization algorithm to tackle the new difficulty. The key feature of the 
algorithm is the use of an on-line estimate of £. This estimate starts with a small 
(or zero) initial value, and is gradually increased when it is invalidated by the 
observations of the output. With the trade-off between a larger number of 
switchings and a higher complexity, the new localization algorithm guarantees 
qualitatively similar properties for the closed loop system as for the case of 
known disturbance bound. 
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Let £(t) be the estimate for £ at time t. Define 

7 0,£M) = {J '■ \9j <!>{*- !) -L(0I < {r> + q)\\<f>(t ~ 1)|| 
+ £(t ~ 1) + v(t ~ l)i y=l,...,L} 



(5.60) 



That is, I(t,£(t)) is the index set of parameter subsets which cannot be falsified 
by any exogenous disturbance sup r >, 0 |£(t)| < £(t — 1). 

Denote the most recent switching instant by s(t). We define s(t) and £(t) as 
follows: 



s(t 0 ) = t 0 (5.61) 

€(to) = 0 (5.62) 



s(t) 

at) 



t = { } and 

s(t — 1) otherwise 

at - 1 ) + if nu«) aa & - 1)) = 

^(t— 1) otherwise 



t- 

{} 



s(t) > td 



and t — 



(5.63) 

s{t) < t d 

(5.64) 



where td is some positive integer representing a length of a moving time interval 
over which validation of a new estimate £(?) is conducted, p is a small positive 
constant representing a steady state residual (to be clarified later), and S(t) is an 
integer function defined as follows: 



6(t) = min 



6: fl I(k,£(k-1)+W¥>{},6€K 

k=s(t ) 



(5.65) 



The main idea behind the estimation scheme presented above is as follows. At 
each time instant when the estimate £(t — 1) is invalidated, that is, 
fU= ? (r) 7 (^; a^ — 1)) — {} we determine the least possible value 6 € N which 
guarantees that no exogenous disturbance sup,>, 0 1£(/)| < (£(i — 1) + 8p) 
would have caused the falsification of all the indices in the current localization 
set. This is done by recomputing the sequence of localization sets over the finite 
period of time [,s( t) , t] whose length is bounded from above by td- Since the 
total number of switchings caused by the ‘wrong’ estimate £(?) is finite and for 
every sufficiently large interval of time the number of switchings due to slow 
parameter drifting can be made arbitrarily small by decreasing the rate of slow 
parameter drifting a it is always possible to choose a sufficiently large td which 
would guarantee global stability of the system. 

The algorithm of localization is modified as follows: 

at) = n‘k=s(t-\Ak,ak)) 

But the switching index i(t) is still defined as in (5.37). 



(5.66) 
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The key properties of the algorithm above are given as follows: 

Theorem 3.4 For any constant p > 0, there exist a parameter drifting bound 
a > 0, a ‘size’ of unmodelled dynamics e > 0 (both sufficiently small), and an 
integer td (sufficiently large), such that the localization algorithm described 
above, when applied to the plant (5.14) with assumptions (A1)-(A3), (A4') and 
(A7), possesses the following properties: 

(1) I(t) f {} for all t > to. 

(2) sup t > t0 £(t) <^+p. 

Subsequently, the following properties hold: 

(3) The closed loop system is globally (£ + p) -exponentially stable, i.e. there 
exist constants M\ > 0, 0 < p < 1, and a function M 2 (-) : R + — * R + with 
M 2 (0) = 0 such that 

||x(OII < M lP ^\\x(t 0 )\\ + M 2 ^+p) (5.67) 

holds for all t > to and x(to). 

(4) The switching sequence {/(to), i(to + 1), • • •} is finitely convergent, i.e. 
i(t) =const, V / > t* for some f if the uncertain parameters are constant. 

Proof See Appendix B. 

We note that even though the value p can be arbitrarily chosen, the estimate 
of the disturbance bound, £(/), can theoretically be larger than £ by the margin 
p. Consequently, the state is only guaranteed to converge to a residual set 
slightly larger than what is given in Theorem 3.4. Our simulation results 
indicate that £(/) very likely converges to a value substantially smaller than £. 
Nevertheless, there are cases where £(/) exceeds £. One possible solution is to 
reduce the value of p. Flowever, a small p may imply a large number of 
potential switchings. 



5.4 Indirect localization principle 

The idea of indirect localization was first proposed in [39] and is based on the 
use of a specially constructed performance criterion as opposed to direct 
localization considered in the previous section. To this end the output of the 
plant is replaced by some auxiliary output observation which is subsequently 
used for the purpose of model falsification. The notion of ‘stabilizing sets’ 
introduced below is central in the proposed indirect localization scheme. We 
first define an auxiliary output, z(f), as 

z(t) = Cx(t), C T G R 2 "- 1 



(5.68) 
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and the inclusion: 

I f :|z(OI<A||*(f-l)||+co (5.69) 

Definition 4.1 1, is said to be a stabilizing inclusion of the system (5.18) if X, 
being satisfied for all t > to and boundedness of £(f)> (£(0 e A»)> implies 
boundedness of the state, x(t), and in particular, there exist ao,/3b and 
a € (0, 1) such that: 

IW0ll<«oa'-'“||x(to)||+/3 o ||6ll C 

Remark 4.1 Note that the inclusion, X, is transformed into a discrete-time 
sliding hyperplane [11] as A — > 0, Co — > 0. In contrast with conventional 
discrete-time sliding mode control we explicitly define an admissible vicinity 
around the sliding hyperplane by specifying the values A > 0 and co > 0. 

Definition 4.2 The uncertain system (5.18) is said to be globally (C, A)- 
stabilizable if 

(1) X, is a stabilizing inclusion of the system (5.18), and 

(2) there exists a control, u(t) = — Kx(t ), such that after a finite time, X, is 
satisfied. 

We will show below that stabilizing sets can be effectively used in the process 
of localization. Before we proceed further we need some preliminary results. 
Assume for simplicity that rj(t) = 0. The case rj(t) f 0 is analysed similarly, 
provided e is sufficiently small. 

Lemma 4.1 Let sup ( >, 0 f (/) < oo, CB > 0. Then there exists 
the system (5.18) is globally (C, 0)-stabilizable if and only if 

|A max (iT4)| < 1 

where 

P= (7 - (CB)-' BCj 

Proof First, suppose that (5.70) is violated, that is 

|A max (iT4)| > 1 (5.72) 

We now show that X, is not a stabilizing inclusion for any co > 0. To do this, 
we take £(t) = 0, and u(t) = — ( CB)~ l CAx(t ). With this control we note from 
(5.18) that z{t) = 0 for t 0, and so foi any co 0, — (/) satisfies (5.69). The 
equation for the closed loop system takes the form 

x(t + 1) = Ax(t) + Bu(t) = PAx(t) (5-73) 

which is not exponentially stable. Therefore, (5.72) implies that there is no co 
such that X, is a stabilizing inclusion. We now establish the converse. Suppose 



a co such that 

(5.70) 

(5.71) 
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(5.70) is satisfied. Then we can rewrite (5.18) as: 



x(t + 1) = PAx(t) + Bu(t) + (BCA)x(t) + E$(t) 

C B 

= PMt) + ^ W + 1) - CEm + E(;(t) 



(5.74) 



From (5.74) it is clear that if z and £ are bounded, then in view of (5.70), x(t) is 
bounded. Therefore, X, is a stabilizing inclusion for any Co- Finally, we take the 
control , 

(5.75) 



which gives 



«(0 = -~^{CA)x{t) 
z (t + 1) = CE£(t) 



(5.76) 



Therefore, for co > \CE\ sup, |£(t)|,Z, is satisfied for all t > 0, and the proof is 
complete. 

Remark 4.2 The control, (5.75), is a ‘one step ahead’ control on the auxiliary 
output, z(t). It then follows that the stability condition (5.70), (5.71) is 
equivalent to the condition that C(z7 — A) -1 B be relative degree one, and 
minimum phase. 

Remark 4.3 If the original plant transfer function, (5.14), is known to be 
minimum phase, and relative degree one then it suffices to take C = E T , and 
the system is then co stabilizable for any <?o > 0. 

If the original plant transfer function is non-minimum phase, then let: 

C=[fo,A ...fn-2,g 0 ,9l 9n- 1] (5-77) 

The transfer function from u(t), via (5.14) to z(t) is then: 
z (t) = P{q)u{t) + G(q)y(t) 

(D{q)F{q) + G{q)N{q)\ (5-78) 

= ( m > (,) 

where F(q) = (/o +f\q H +fn- 2 <f~ 2 ) and G(q) = (g 0 + g x q + . . . + 

Therefore, for a non minimum phase plant, knowledge of a C such that X, is 
a stabilizing inclusion is equivalent to knowledge of a (possible improper) 
controller {u{t) = — G(q)/F(q)y(t)} which stabilizes the system. Because we are 
dealing with discrete-time systems, it is not clear whether this corresponds to 
knowledge of a proper, stabilizing controller for the set. 

Remark 4.4 Because of the robustness properties of exponentially stable 
linear time invariant systems, Lemma 4.1 can easily be generalized to include 
nonzero, but sufficiently small A. 
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Lemma 4.2 Any O which satisfies assumptions (A2) and (A3) has a finite 
decomposition into compact sets: 

L 

= U <h (5.79) 

t= l 

such that for each (, there exists a C/, Ay and co/ such that, for all 
( A,B ) € il ( ,X, is a stabilizing inclusion, and CyB has constant sign. 

Proof (Outline) 

It is well known that (see, e.g., [8]) that fl has a finite decomposition into sets 
stabilized by a fixed controller. From Remark 4.3, the requirements for 
knowledge of a Q such that Z(Q, •, •) is a stabilizing set on O are less stringent 
than knowledge of a stabilizing controller for the set if . 

We now introduce our control method, including the method of localization 
for determining which controller to use. The first case we consider is the 
simplest case where there is a single set to consider. 

Case 1: L = I (sign of CB known) 

This case covers a class of minimum phase plants, plus also certain classes of 
nonminimum phase plants. 

For L = 1 we have: 

S 

= O 1 = (J T2 t - (5.80) 

1=1 



For i = 1 ... s we define a control law: 

u(t)' = - K,-x(t ) = — — F CAjx(t) (5.81) 

C_. D j 

where the plant model. A,-, Bj is in the set fi,-. We require knowledge of a A such 
that: 

||C(^-^(^))||< A; Vi,V(dJ)ea (5.82) 

and 1, is a stabilizing inclusion on 12,- for all i. Note that for any bounded f 1, for 
which we can find a single C which gives C(z7 — A)~ l B minimum phase and 
relative degree one we can always find, for 5 large enough, a A with the 
required properties (see, e.g., [8]). 

At any time t > 0, the auxiliary output z(t + 1)' which would have resulted if 
we applied u(t)‘ = —Kjx(t) to the true plant is, using (5.18). 

z(t + 1) ! — CAx(t) + CBu(ty + CE$(t) 

= z(t + 1) — CB^u(t) — u(t) ,S j 



(5.83) 
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Note that if the true plant is in the set f I,-, then from (5.83) and (5.81) 

z(t+ I)'' = c(a + CE&t) (5.84) 

and, therefore, if the true plant is in f 2,-, then from (5.82), and with c o = \CE£\ 

\z(t + 1) ; | < A||.\'(/)|| + Co (5.85) 

Our proposed control algorithm for Case 1 is as follows (where, without loss of 
generality, we take CB > 0). 



Algorithm C 

1.1 Initialization 
Define 

So = {1,2,..., 4 (5.86) 



1.2 If t> 0, 

If z(t) > A|x(t - 1)| + c 0 then set S, = S,_i - {k, . . . ,j s -uj s } 

If -z(t) > A|x(t - 1)| + c 0 then set S, = S,-\ - {jiji, ■ ■ ■ , k} 
otherwise, S t = S t - 1 - 

where A’,;) ... j s and ^ are integers from the previous time instant (see 
1.4, 1.5). 

1.3 If t> 0, 

For all i € S , , compute u(t)‘ as in (5.81). 

1.4 Order u(t) 1 , i G S, such that: 

u{t) h < u{t) h < ... < u(t) Js (5.87) 



1.5 Apply the ‘median’ control: 



1.6 



u(t) = u{t) k 

where k = j [s/2 \ , 

Then wait for the next sample and return to 1 .2. 



(5.88) 



We then have the following stability result for this control algorithm. 



Theorem 4.1 The control algorithm, (5.86) (5.88), applied to a plant where C 
is known, and where the decomposition (5.80) has the properties that (5.82) is 
satisfied and 1, is a stabilizing inclusion, has the following properties: 



(a) The inclusion: 



X, : \z(t)\ < A||x(f - 1)|| + c 0 



(5.89) 



is violated no more than N = |_log 2 (-s) J times, and 
(b) All signals in the closed loop system are bounded. In particular, there exist 
constants a, (3 < oo,cr e (0, 1) such that all trajectories satisfy, for any 
t 0 ,T>0 

||x(?o + C)|| < aCT r ||x(t 0 )|| + (3 



(5.90) 
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Proof 

(a) Suppose at time (t + 1), (5.89) is violated. This can occur in one of two 
ways which we consider separately: 

(i) 

z(t+ 1) = z(t + 1)'' > A||.\-(t)|| + co (5-91) 

In this case, because of the ordering of w(t)' in (5.87), and the definition of 
z{t + 1)' in (5.83), then 

z(t+ 1)' > A||x(t)|| + c 0 (5.92) 

for all i G {k, . . . ,j s _! 

(ii) 

z(t + 1) = z(t+ 1)'' < -(A||.\-(t)|| + c 0 ) (5-93) 

In this case 

z(t+ 1)' < -(A||x(t)|| + c 0 ) (5.94) 

for all i = {/i ,/ 2 , In either case, we see that if (5.91) is violated at 
time t. then 

s,+ i< 2 S ‘ (5-95) 

from which the result follows. 

(b) First, we note that the control is well defined, that is, S t is never empty. 
This follows since there is at least one index, namely the index of the set fl, 
which contains the true plant, which is always an element of S,. 



Next, we note that although we cannot guarantee that we converge to the 
correct control, from (a) we know (5.85) is satisfied all but a finite number of 
times. 

Since Xj is a stabilizing inclusion, then by definition the states and all signals 
will be bounded. 

Furthermore, since X , is a stabilizing inclusion, there exist a 0 ,A) and 
cr G (0, 1) such that if the inclusion (5.89) is satisfied, for t G [to, to + T), then 

IM ? o + r )ll < aoo- r ||x(t 0 )|| +A> (5.96) 



(Note that if this is not the case, then from the definition, X, is not a stabilizing 
inclusion.) Also, there exist d and (3 such that when (5.89) is violated: 



\\x(t+l)\\<d\\ X (t)\\+p (5.97) 

If we define on = “““ and f3\ = (aod/?o + /3o + oio(3), then after some algebraic 
manipulations we can show that for any to, T > 0 such that (5.89) is violated 
not more than once in the interval, (to, to + T ), then 



IM ? o+ r )ll < 0:1a 1 ||x(to)|| +fi 



Also, we can show that with 012 = 



- 3 

OLOL{)(y.\ CHQIq 



= and fo 



(5.98) 
+ f3 + (y.Q(y.f3i = 



a 




